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Seedyl sequence for making plants having changed growth 

characteristics 



The present invention concerns a method for modifying growth characteristics of a plant. More 
5 specifically, the present invention concerns a method for modifying growth characteristics by 
modified expression of a seedyl nucleic acid and/or by modified levels and/or activity of a 
seedyl protein in a plant. The present invention also concerns plants having modified 
expression of a seedyl nucleic acid and/or modified levels and/or activity of a seedyl protein, 
which plants have modified growth characteristics relative to corresponding wild type plants. 

10 

The ever-increasing world population and the dwindling supply of arable land available for 
agriculture fuel research towards improving the efficiency of agriculture. Conventional means 
for crop and horticultural improvements utilise selective breeding techniques to identify plants 
having desirable characteristics. However, such selective breeding techniques have several 

1 5 drawbacks, namely that these techniques are typically labour intensive and result in plants that 
often contain heterogeneous genetic components that may not always result in the desirable 
trait being passed on from parent plants. Advances in molecular biology have allowed mankind 
to modify the germplasm of animals and plants. Genetic engineering of plants entails the 
isolation and manipulation of genetic material (typically in the form of DNA or RNA) and the 

20 subsequent introduction of that genetic material into a plant. Such technology has the capacity 
to deliver crops or plants having various improved economic, agronomic or horticultural traits. 
A trait of particular economic interest is yield. Yield is normally defined as the measurable 
produce of economic value from a crop. This may be defined in terms of quantity and/or 
quality. Crop yield may not only be increased by combating one or more of stresses to which a 

25 crop or plant is typically subjected, but may also be increased by modifying the inherent growth 
characteristics of a plant. Yield is directly dependent on several growth characteristics, for 
example, the growth rate, the biomass production, plant architecture, number and size of the 
organs, (for example, the number of branches, tillers, shoots, flowers), seed production and 
more. 

30 

The ability to influence one or more of the abovementioned growth characteristics, would have 
many applications in areas such as crop enhancement, plant breeding, production of 
ornamental plants, aboriculture, horticulture, forestry, production of algae or plants (for 
example for use as bioreactors, for the production of substances such as pharmaceuticals, 

35 antibodies, or vaccines, or for the bioconversion of organic waste or for use as fuel in the case 
of high-yielding algae and plants). 
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It has now been found that modification in a plant of the expression of a seedy 1 nucleic acid 
and/or modification of the level and/or activity levels in a plant of a seedy 1 protein, gives rise to 
plants having modified growth characteristics. It has been shown that introduction of a seedyl 
nucleic acid effects an increase in above ground biomass, increased total seed weight, increased 
5 number of seeds and increased number of tillers. It has been found that the hitherto unknown 
protein seedyl, is a protein having a coiled coil domain and three conserved motifs as presented 
in SEQ ID NO's 15, 16 and 17. 

Therefore, the present invention provides a method for modifying growth characteristics of a 
10 plant, comprising modifying in a plant expression of a nucleic acid encoding a seedyl protein 
and/or modifying in a plant level and/or activity of a seedyl protein, wherein said seedyl 
protein comprises in the following order from N-terminus to C-terminus at least two motifs, 
which motifs are: 

(i) a motif having at least 80% sequence identity to the sequence represented by 
15 SEQ ID NO 15; and/or 

(ii) a motif having at least 80% sequence identity to the sequence represented by 
SEQ ID NO 16, and/or 

(iii) a motif having at least 80% sequence identity to the sequence represented by 
SEQ ID NO 17 and which is a coiled coil motif; and/or 

20 Ov) a motif having at least 80% sequence identity to the sequence represented by 

SEQ ID NO 18. 

and wherein said growth characteristics are modified relative to the corresponding wild- 
type plants. 

25 Modifying expression of a seedyl nucleic acid and/or modifying of the activity and/or levels of 
a seedyl protein encompasses modifying expression of a gene and/or modifying acti vity 
and/or levels of a gene product, namely a polypeptide, in specific cells or tissues. The term 
"modifying" as used herein means increasing, decreasing or changing in time or place. The 
modified expression, activity and/or levels of a seedyl gene or protein are modified compared 

30 to expression, activity and/or levels of a seedyl gene or protein in corresponding wild-type 
plants. The modified gene expression may result from modified expression levels of an 
endogenous seedyl gene and/or may result from modified expression levels of a seedyl gene 
previously introduced into a plant. Similarly, modified levels and/or activity of a seedyl protein 
may be due to modified expression of an endogenous seedyl nucleic acid/gene and/or due to 

35 modifyed expression of a seedyl nucleic acid/gene previously introduced into a plant. Modified 
expression of a gene/nucleic acid and/or modifying activity and/or levels of a gene 
product/protein may be effected, for example, by chemical means and/or recombinant means. 
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Therefore there is provided by the present invention, a method for modifying growth 
characteristics of a plant, comprising modifying seedyl gene expression and/or modifying 
seedyl protein levels and/or seedyl protein activity, which modification may be effected by 
5 recombinant means and/or by chemical means. 

Advantageously, modified expression of a seedyl nucleic acid and/or modified activity and/or 
levels of a seedyl protein may be effected by chemical means, i.e. by exogenous application 
of one or more compounds or elements capable of modifying expression of a seedyl nucleic 

10 acid and/or capable of modifying activity and/or levels of the seedyl protein. The term 
"exogenous application" as defined herein is taken to mean the contacting or administering of 
a suitable compound or element to a plant (as defined herein below, the term "plant" includes 
plant cell, tissue, organ or to the whole organism). The compound or element may be 
exogenously applied to a plant in a form suitable for plant uptake (such as through application 

15 to the soil for uptake via the roots, or in the case of some plants by applying directly to the 
leaves, for example by spraying). The exogenous application may take place on wild-type 
plants or on transgenic plants that have previously been transformed with a seedyl nucleic 
acid/gene or another transgene. 

20 Suitable compounds or elements include seedyl proteins or seedyl nucleic acids. 
Alternatively, exogenous application of compounds or elements capable of modifying levels of 
factors that directly or indirectly activate or inactivate a seedyl protein will also be suitable in 
practising the invention. Also included are antibodies that can recognise or mimic the function 
of seedyl proteins. Such antibodies may comprise "plantibodies", single chain antibodies, IgG 

25 antibodies and heavy chain camel antibodies, as well as fragments thereof. Additionally or 
alternatively, the resultant effect may also be achieved by the exogenous application of an 
interacting protein or activator or an inhibitor of the seedyl gene/gene product. Additionally or 
alternatively, the compound or element may be a mutagenic substance, such as a chemical 
selected from any one or more of: N-nitroso-N-ethylurea, ethylene imine, ethyl 

30 methanesulphonate and diethyl sulphate. Mutagenesis may also be achieved by exposure to 
ionising radiation, such as X-rays or gamma-rays or ultraviolet light. Methods for introducing 
mutations and for testing the effect of mutations (such as by monitoring gene expression 
and/or protein activity) are well known in the art 

35 Therefore, according to one aspect of the present invention, there is provided a method for 
modifying growth characteristics of a plant, comprising exogenous application of one or more 
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compounds or elements capable of modifying expression of a nucleic acid encoding a seedyl 
protein and/or capable of modifying activity and/or levels of a seedyl protein. 

Additionally or alternatively, and according to a preferred embodiment of the present invention, 
modification of expression of a seedyl nucleic acid and/or modification of activity and/or levels 
of a seedyl protein may be effected by recombinant means. Such recombinant means may 
comprise a direct and/or indirect approach for modification of expression of a nucleic acid 
encoding a seedy 1 and/or for modification of the activity and/or levels of a seedyl protein. 

The nucleic acid encoding a seedy one protein or the seedyl protein as mentioned above may 
be wild type, i.e. a native or endogenous nucleic acid or protein. Alternatively, it may be a 
nucleic acid derived from the same or another species, which nucleic acid is introduced as a 
transgene, for example by transformation. This transgene may be substantially changed from 
its native form in composition and/or genomic environment through deliberate human 
manipulation. 

An indirect recombinant approach may comprise for example introducing, into a plant, a 
nucleic acid capable of modifying expression of the gene in question (a seedyl gene) and/or 
capable of modifying activity and/or levels of the protein in question (a seedyl protein). 
Examples of such nucleic acids to be introduced into a plant are nucleic acids encoding 
transcription factors or activators or inhibitors that bind to the promoter of a seedyl gene or 
that interact with a seedyl protein. Methods to test these types of interactions and methods for 
isolating nucleic acids encoding such interactors include yeast one-hybrid or a yeast two- 
hybrid screens wherein the seedyl gene/protein is used as a bait. 

Also encompassed by an indirect approach for modifying expression of a seedyl gene and/or 
activity and/or levels of a seedyl protein, is the provision of, or the inhibition or stimulation of 
regulatory sequences that drive expression of the native seedyl gene or of the seedyl 
transgene. Such regulatory sequences may be introduced into a plant. For example, the 
nucleic acid introduced into the plant is a promoter, capable of driving the expression of an 
endogenous seedyl gene. 

A further indirect approach for modifying expression of a seedyl gene and/or for modifying 
activity and/or levels of a seedyl protein in a plant, encompasses modified levels in a plant of 
a factor able to interact with seedyl. Such factors may include ligands of seedyl . Therefore, 
the present invention provides a method for modifying growth characteristics of a plant, 
comprising modifying expression of a gene coding for a protein which is a natural ligand of a 
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seedyl. Furthermore, the present invention also provides a method for modifying grow 
characteristics of a plant, comprising modifying expression of a gene coding for a protein whk 
is a natural target/substrate of a seedyl. 

5 A direct and more preferred approach for modifying growth characteristics of a plar 
comprises introducing into a plant a seedyl nucleic acid, or a portion thereof or sequence 
capable of hybridising therewith, which nucleic acid preferably encodes a seedyl protein or 
homologue, derivative or active fragment thereof. The nucleic acid may be introduced into 
plant by, for example, transformation. 

10 

Accordingly, the present invention provides a method for modifying growth characteristics of 
plant, comprising introducing into a plant a nucleic acid capable of modifying expression of 
nucleic acid encoding a seedyl protein and/or capable of modifying activity and/or levels of 
seedyl protein. Further preferably such nucleic acid is a seedyl nucleic acid. 

15 

As mentioned above the nucleic acid to be used in the methods of the present invention ce 
be wild type (native or endogenous). Alternatively, the nucleic acid may be derived firoi 
another species, which gene is introduced into the plant as a transgene, for example t 
transformation. The nucleic acid may thus be derived (either directly or indirectly ■ 

20 subsequently modified)) from any source provided that the nucleic acid, when expressed in 
plant, leads to modified expression of a seedyl nucleic acid/gene or modified activity and/c 
levels of a seedyl protein. The nucleic acid may be isolated from a microbial source, such a 
bacteria, yeast or fungi, or from a plant, algae, insect, or animal (including human) souro 
Preferably, the seedyl nucleic acid is isolated from a plant This nucleic acid may b 

25 substantially changed from its native form in composition and/or genomic environment throug 
deliberate human manipulation. The nucleic acid may be isolated from a dicotyledonoi 
species, preferably from the family Solanaceae, further preferably from Nicotians. Mor 
preferably, the nucleic acid is as represented by SEQ ID NO: 1 or a portion thereof or a nuclei 
acid capable of hybridising therewith or is a nucleic acid encoding an amino acid represente 

30 by SEQ ID NO: 2 or a homologue derivative on active fragment thereof, such as a homologu 
having at least 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80°/ 
85%, 90%, 95% 98%, 99% sequence identity with SEQ ID NO 2. 

Advantageously, the methods according to the invention may also be practised using variar 
35 nucleic acids and variant amino acids of SEQ ID NO 1 or 2 respectively, such as the variant 
further defined hereinafter. 
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The term a seedy1 gene" or tt seedy1 nucleic acid" or "nucleic acid encoding a seedyl protein" 
are used herein interchangeably. 

Taken in a broad context, the term "seedy" protein/nucleic acid also encompasses variant 
nucleic acids and variant amino acids suitable for practicing the methods according to the 
5 invention. Preferably, variant nucleic acids and variant amino acids suitable for practicing the 
methods according to the invention include those falling within the definition of a "seedyl" 
meaning a encoding a protein ore being a protein comprising in the following order from N- 
terminus to C-terminus at least two motifs, which motifs are: 

(i) a motif having at least 80% sequence identity to the sequence represented by 
10 SEQ ID NO 15; and/or 

(ii) a motif having at least 80% sequence identity to the sequence represented by 
SEQ ID NO 16, and/or 

(Hi) a motif having at least 80% sequence identity to the sequence represented by 
SEQ ID NO 1 7 and which is a coiled coil motif; and/or 
IS (iv) a motif having at least 80% sequence identity to the sequence represented by 

SEQ ID NO 18. 

For determining the presence of these conserved motifs sequence can be aligned with the 
software such as for example Align X or clustal X, for indication of the conserved residues (see 

20 for example Figure 3). Software packages like MEME version 3.0 can also be used to 
determine motifs in sequences. This software is available from UCSD, SDSC and NBCR at 
http://meme.sdsc.edu/meme/. For the identification of a coiled coil domain, the software Coils 
2.0 can be used. This software is available at 
http://www.ch.embnet.oro/software/COlLS form.html . For the presentation of the motifs in 

25 SEQ Id NO 15, 16, 17 and 18, X represents any amino acid. 

According to a particular embodiment a seedy 1 protein has at least two, preferably three, 
most preferably four of the above defined domains. 

30 According to a particular embodiment of the invention, a motif having at least 80% sequence 
identity to the sequence represented by SEQ ID NO 15 is the core sequence WXNAXXD as 
represented in SEQ ID NO 15, or at least part of the sequence 
(P/X)X((V/L/H)(Q/H)(V/I)W(N/X)NA(A/P)(F/C)D wherein 
(P/X) preferably is P or is A or T or Q or another amino acid 

35 (V/L/H) preferable is V or L or H 
(Q/H) is either Q or H 

(V/l) is either V or is T or S or another amino acid 

6 
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(A/P) is preferable A or is P 
(F/C) is preferably F or is C 

Alternatively or additionally, according to one embodiment a motif having at least 80% 
sequence identity to the sequence represented by SEQ ID NO 16, is the core sequence 
KENXXP as represented in SEQ ID NO 16. 

Alternatively or additionally, according to one embodiment a motif having at least 80% 
sequence identity to the sequence represented by SEQ ID NO 17 and which is a coiled coil 
motif is the core sequence EX^EXXRLXXXLXXLR as represented in SEQ ID NO 17, or is at 

least P ar t of the sequence 

(l/V/A)(D/E)XE(l/M)XX(IA/)(E/Q)XE(l^^ 
(l/V/A) preferable is I or V or is A 
(D/E) is either D or E 
IS (l/M) preferably is I or is M 
(l/V) preferably is I or is V 
(E/Q) preferably is E or is Q 

(l/X) preferably is I or is M or is V or any other amino acid 
(S/X) preferably S or is T or any other amino acid 
20 (S/X) preferably is S or isT or L or I or A 
(R/K) preferably is R or is K 
(UV/T/I) preferable is L or T or V or I 
(K/Q) preferably is K or Q 

25 Alternatively or additionally, according to one embodiment a motif having at least 80% 
sequence identity to the sequence represented by SEQ ID NO 18, is the core sequence 
LPXIX^oRDSGXXKRX^K as represented in SEQ ID NO 18, or is at least part of the 
sequence 

LP(R/K)I(F^)(T/I)(M/X)(P^^ 
30 /X)(G/A)K, wherein 

(R/K) is either R or K 

(R/X) is preferably R or is S or K 

(T/l) is preferably T or I 

(M/X) is preferably M or L or A or V 
35 (P/R) is either P or R 

(D/X) is preferably D or is G or T or N 

(E/G) is preferably E or is G 
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(S/T) is preferably S or is T 
(P/L) is preferably P or is L 
(C/X) is preferably C or is P or A 
(A/X) is preferably A or is V or I 
(A/I) is preferably A or is I 
(D/E) is either D or E 
(L/R) is preferably L or is R 
(V/X) is preferably V or is Q or N or I 
(G/A) is preferably G or is A 

According to a further embodiment the nucleic acid has a motif which has at least 80% 
sequence identity to the sequence RDSGXXKRXi^K. 

Examples of such seedy 1 proteins are monocots seedyl proteins such as represented by SEQ 
ID NO 4 (rice), SEQ ID NO 8 (sugar cane) and SEQ ID NO 10 (maize) or from dicots SEQ ID 
NO 2 (tobacco), SEQ ID NO 6 (medicago) or SEQ ID NO 12 (Arabidopsis). The proteins as 
presented in SEQ ID NO 8 (sugar cane) and SEQ ID NO 10 (Maize) are only partial, but the 
corresponding full length sequences of the proteins and encoding cDNA, are well within the 
reach of a person skilled in the art who is now able to perform colony hybridization of a cDNA 
library, or PCR based on the use of specific primers combined with degenerated primers. 

Suitable variant nucleic acid and amino acid sequences useful in practising the method 
according to the invention, include: 

(i) Functional portions of a seedyl nucleic acid/gene; 

(ii) Sequences capable of hybridising with a seedyl nucleic acid/gene; 

(iii) Alternative splice variants of a seedyl nucleic acid/gene; 

(iv) Allelic variants of a seedyl nucleic acid/gene; 

(v) Homologues, derivatives and active fragments of a seedyl protein; 

The term seedyl nucleic acid/gene, as defined herein, also encompasses a complement of 
SEQ ID NO 1 and also to corresponding RNA, DNA, cDNA or genomic DNA. The seedyl may 
be synthesized in whole or in part, it may be double-strand nucleic acid or single-stranded 
nucleic acid. Also this term encompasses a variant of the gene due to the degeneracy of the 
genetic code and variants that are interrupted by one or more intervening sequences. 

An example of a variant seedyl nucleic acid is a functional portion of a seedyl nucleic acid. 
The methods according to the invention may advantageously be practised using functional 
portions of a seedyl. A functional portion refers to a piece of DNA derived or prepared from 

8 
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an original (larger) DNA molecule, which DNA portion, when introduced and expressed in 
plant, gives plants having changed development. The portion may comprise many genes, wil 
or without additional control elements or may contain spacer sequences. The portion may b 
made by making one or more deletions and/or truncations to the nucleic acid. Techniques ft 
introducing truncations and deletions into a nucleic acid are well known in the art. Portion 
suitable for use in the methods according to the invention may readily be determined b 
following the methods described in the Examples section by simply substituting the sequeno 
used in the actual Example with the portion to be tested for functionality. 

An example of a further variant seedyl nucleic acid is a sequence that is capable c 
hybridising to a seedyl nucleic acid, for example to SEQ ID NO 1, 3, 5, 7, 9 or 11 
Advantageously, the methods according to the present invention may also be practised using 
sequences capable of hybridising to a coiled coil protein, particularly a seedyl protein as 
represented by any one of SEQ ID NO: 2, 4, 6, 8, 10 or 12, which hybridising sequences are 
preferably those falling within the definition of a "seedyl" as set out herein before. Hybridising 
sequences suitable for use in the methods according to the invention may readily be 
determined for example by following the methods described in the Examples section by simply 
substituting the sequence used in the actual Example with the hybridising sequence. 

The term "hybridisation" as defined herein is a process wherein substantially homologous 
complementary nucleotide sequences anneal to each other. The hybridisation process can 
occur entirely in solution, i.e. both complementary nucleic acids are in solution. Tools in 
molecular biology relying on such a process include the polymerase chain reaction (PCR; and 
all methods based thereon), subtractive hybridisation, random primer extension, nuclease S1 
mapping, primer extension, reverse transcription, cDNA synthesis, differential display of RNAs, 
and DNA sequence determination. The hybridisation process can also occur with one of the 
complementary nucleic acids immobilised to a matrix such as magnetic beads, Sepharose 
beads or any other resin. Tools in molecular biology relying on such a process include the 
isolation of poly (A+) mRNA The hybridisation process can furthermore occur with one of the 
complementary nucleic acids immobilised to a solid support such as a nitro-cellulose or nylon- 
membrane or immobilised by e.g. photolithography to e.g. a siliceous glass support (the latter 
known as nucleic acid arrays or microarrays or as nucleic acid chips). Tools in molecular 
biology relying on such a process include RNA and DNA gel blot analysis, colony hybridisation, 
plaque hybridisation, in situ hybridisation and microarray hybridisation. In order to allow 
hybridisation to occur, the nucleic acid molecules are generally thermally or chemically 
denatured to melt a double strand into two single strands and/or to remove hairpins or other 
secondary structures from single stranded nucleic acids. The stringency of hybridisation is 

9 



105-seedy1-EP 

influenced by conditions such as temperature, salt concentration and hybridisation buffer 
composition. High stringency conditions for hybridisation include high temperature and/or low 
salt concentration (salts include NaCI and Na 3 -citrate) and/or the inclusion of formamide in the 
hybridisation buffer and/or lowering the concentration of compounds such as SDS (detergent) 
in the hybridisation buffer and/or exclusion of compounds such as dextran sulphate or 
polyethylene glycol (promoting molecular crowding) from the hybridisation buffer. 
Conventional hybridisation conditions are described in, for example, Sambrook (2001) 
Molecular Cloning: a laboratory manual, 3rd Edition Cold Spring Harbor Laboratory Press, 
CSH, New York, but the skilled craftsman will appreciate that numerous different hybridisation 
conditions can be designed in function of the known or the expected homology and/or length of 
the nucleic acid. Sufficiently low stringency hybridisation conditions are particularly preferred 
(at least in the first instance) to isolate nucleic acids heterologous to the DNA sequences of the 
invention defined supra. An example of low stringency conditions is 4-6x SSC / 0.1-0.5% w/v 
SDS at 37-45°C for 2-3 hours. Depending on the source and concentration of the nucleic acid 
involved in the hybridisation, alternative conditions of stringency may be employed, such as 
medium stringency conditions. Examples of medium stringency conditions include 1-4x SSC / 
0.25% w/v SDS at a 45°C for 2-3 hours. An example of high stringency conditions includes 
0.1-1x SSC / 0.1% w/v SDS at 60°C for 1-3 hours. The skilled man will be aware of various 
parameters which may be altered during hybridisation and washing and which will either 
maintain or change the stringency conditions. The stringency conditions may start low and be 
progressively increased until there is provided a hybridising seedyl nucleic acid, as defined 
hereinabove. Elements contributing to heterology include allelism, degeneration of the genetic 
code and differences in preferred codon usage. 

Another example of a variant seedyl is an alternative splice variant of a seedyl . The methods 
according to the present invention may also be practised using an alternative splice variant of 
a seedyl nucleic acid. The term "alternative splice variant" as used herein encompasses 
variants of a nucleic acid in which selected introns and/or exons have been excised, replaced 
or added. Such splice variants may be found in nature or can be manmade using techniques 
well known in the art. Preferably, the splice variant is a splice variant of the sequence 
represented by any of SEQ ID NO 1, 3, 5, 7, 9 or 11. Splice variants suitable for use in the 
methods according to the invention may readily be determined for example by following the 
methods described in the Examples section by simply substituting the sequence used in the 
actual Example with the splice variant. 

Another example of a variant seedyl is an allelic variant. Advantageously, the methods 
according to the present invention may also be practised using allelic variants of a seedyl 

10 



105-seedyi-EP 

nucleic acid, preferably an allelic variant of a sequence represented by any of SEQ ID NO 1 , 3, 
5, 7, 9 or 1 1 . Allelic variants exist in nature and encompassed within the methods of the 
present invention is the use of these isolated natural alleles in the methods according to the 
invention. Allelic variants encompass Single Nucleotide Polymorphisms (SNPs), as well as 
5 Small Insertion/Deletion Polymorphisms (INDELs). The size of INDELs is usually less than 100 
bp). SNPs and INDELs form the largest set of sequence variants in naturally occurring 
polymorphic strains of most organisms. Allelic variants suitable for use in the methods 
according to the invention may readily be determined for example by following the methods 
described in the Examples section by simply substituting the sequence used in the actual 
10 Example with the allelic variant. 

Accordingly, the present invention provides a method for modifying growth characteristics of a 
plant, wherein the seedyl nucleic acid is a splice variant of a seedyl nucleic acid or wherein 
said seedyl protein is encoded by a splice variant or wherein the seedyl nucleic acid is an 
IS allelic variant of a seedyl nucleic acid or wherein said seedyl protein is encoded by an allelic 
variant. 

Examples of variant seedyl amino acids also in elude homologues, derivatives and active 
fragments of a seedyl protein. Advantageously, the methods according to the present 
20 invention may also be practised using homologues, derivatives or active fragments of a coiled 
coil, preferably using homologues, derivatives or active fragments of a seedyl protein, 
preferably a seedyl protein as represented by any one of SEQ ID NO 2, 4, 6, 8, 10 or 12. 

"Homologues" of a seedyl protein encompass peptides, oligopeptides, polypeptides, proteins 
25 and enzymes having amino acid substitutions, deletions and/or insertions relative to the 
unchanged protein in question and having similar biological and functional activity as the 
unchanged protein from which they are derived. To produce such homologues, amino acids of 
the protein may be replaced by other amino acids having similar properties (such as similar 
hydrophobicity, hydrophilicity, antigenicity, propensity to form or break a-helical structures or 0- 
30 sheet structures). Conservative substitution tables are well known in the art (see for example 
Creighton (1984) Proteins. W.H. Freeman and Company). 

The homologues useful in the methods according to the invention have a percentage identity 
to any one of SEQ ID NO 2, 4, 6, 8, 10 or 12 equal to value lying between 20% and 99.99%. 
35 The homologues useful in the method according to the invention have at least 20%, 21%, 22%, 
23%, 24%, 25%, 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 
39%, 40%, 41%, 42%, 43%, 44%, 45%, 46%, 47%, 48%, 49%, or 50% sequence identity or 
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similarity (functional identity) to the unchanged protein, alternatively at least 60% sequence 
identity or similarity to an unchanged protein, alternatively at least 70% sequence identity or 
similarity to an unchanged protein. Typically, the homologues have at least 75% or 80% 
sequence identity or similarity to an unchanged protein, preferably at least 85%, 86%, 87%, 
5 88%, 98% sequence identity or similarity, farther preferably at least 90%, 91%, 92%, 93%, 
94% sequence identity or similarity to an unchanged protein, most preferably at least 95%, 
96%, 97%, 98% or 99% sequence identity or similarity to an unchanged protein. 
The percentage of identity can be calculated by using an alignment program well known in the 
art. For example, the percentage of identity can be calculated using the program GAP, or 
10 needle (EMBOSS package) or stretcher (EMBOSS package) or the program align X, as a 
module of the vector NTI suite 5.5 software package, using the standard parameters (for 
example GAP penalty 5, GAP opening penalty 15, GAP extension penalty 6.6). 

The homologues useful in the methods according to the invention are preferably coiled coil 
15 proteins, further preferably seedy 1 proteins as defined herein above. Homologues suitable for 
use in the methods according to the invention may readily be determined for example by 
following the methods described in the Examples section by simply substituting the sequence 
used in the actual Example with the homologous sequence. 

20 Methods for the search and identification of seedyl homologues or DNA sequences encoding 
a seedyl homologue, would be well within the realm of persons skilled in the art. Such 
methods, involve screening sequence databases with the sequences as provided by the 
present invention in SEQ ID NO 1 and 2 or 3 to 10, preferably a computer readable format of 
the nucleic acids of the present invention. This sequence information is available for example 

25 in public databases, that include but are not limited to Genbank 
(http^/www.ncbi.nlm.nih.qov/web/Genbank) . the European Molecular Biology Laboratory 
Nucleic acid Database (EMBL) (http:/w.ebi.ac.uk/ebi-docs/embl-db.html) or versions thereof or 
the MIPS database (http://mips.gsf.de/). Different search algorithms and software for the 
alignment and comparison of sequences are well known in the art. Such software includes 

30 software include GAP, BESTFIT, BLAST, FASTA and TFASTA. GAP uses the algorithm of 
Needleman and Wunsch (J. Mol. Biol. 48: 443-453, 1970) to find the alignment of two 
complete sequences that maximises the number of matches and minimises the number of 
gaps. The BLAST algorithm calculates percentage sequence identity and performs a statistical 
analysis of the similarity between the two sequences. The suite of programs referred to as 

35 BLAST programs has 5 different implementations: three designed for nucleotide sequence 
queries (BLASTN, BLASTX, and TBLASTX) and two designed for protein sequence queries 
(BLASTP and TBLASTN) (Coulson, Trends in Biotechnology: 76-80, 1994; Birren et al., 
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GenomeAnalysis, 1: 543, 1997). The software for performing BLAST analysis is publid 
available through the National Centre for Biotechnology Information. 

Homologues of SEQ ID NO 2 can be found in many different organisms. The closes 
homologues are found in the plant kingdom. For example, seedyl nucleic acids were isolafo 
from tobacco (SEQ ID NO 1 and 2), from rice (SEQ ID NO 3) encoding a rice seedy 
homologue (SEQ ID NO 4), from medicago (SEQ ID NO 5 and 6), from sugar cane (SEQ IC 
NO 7 and 8), from maize (SEQ ID NO 9 and 10) and from Arabidopsis (SEQ ID NO 11 anc 
12). Also ESTs from other organisms were found and deposited in Genbank, for example EST 
from Vitis vinifera (accession number CA816066), from Pinus taeda (accession numbei 
BM903108), from Saccharus sp. (So variant) (accession numbers CA228193 and CA256020) 
from Citrus sinsensis (accession number CF833583), Plumbago zeylanica (accession numbei 
CB817788), from Zea mays (accession number CF637447, AW282224, CD058812, 
AY108162, CD059048, CF041861, AW067243), from Triticum aestivum (CA727065, 
BJ264506, BJ259034), from Hordeum vulgare (accession number BU997034, CA727065, 
CA031127, BQ762011), from Bn (CD817460) from Ga (BG446106, BM360339), from 
Eschscholzia califronica (CD478368), from Pt (BU821376), and from beta vulgaris 
(BQ594009). 

As more genomes are being sequenced, it is expected that many more seedyl homologues 
shall be identifiable. 

The above-mentioned analyses for comparing sequences, for calculation of sequence identity 
and for the search for homologues, is preferentially done with the full-length sequence or with 
a conserved region of the sequence. Therefore, these analyses can be based on a comparison 
of certain regions such as conserved domains, motifs or boxes. 

The identification of such domains or motifs, would also be well within the realm of a person 
skilled in the art and involves for example, a computer readable format of the nucleic acids of 
the present invention, the use of alignment software programs and the use of publicly available 
information on protein domains, conserved motifs and boxes. This protein domain information 

ls available - in - the PRODOM 

(http://www.biochem.ud.ac.uk/bsm/dbbrowser/ij/prodomsrchij.html), pir 
(http://pir.georgetown.edu/) or pFAM (http://pfam.wustJ.edu/) database. Sequence analysis 
programs designed for motif searching can be used for identification of fragments, regions and 
conserved domains as mentioned above. Preferred computer programs would include but are 
not limited to MEME, SIGNALSCAN, and GENESCAN. A MEME algorithm (Version 3.0) can 
be found in the GCG package; or on the Internet site http://www.sdsc.edu/MEME/meme. 
SIGNALSCAN version 4.0 information is available on the Internet site 

13 
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http://biosci.cbs.umn.edu/soflware/sigscan.html. GENESCAN can be found on the Internet site 
http^/gnomicstanford.edu/GENESCANW.html. 

More particularly preferred seedyl homologues have the conserved domains as depicted in 
SEQ ID NO 15, 16 and 17, or motifs that are 80% identical to these motifs. Also preferred 
seedyl homologues have a coiled coil domain, preferably located in the N-terminal half of the 
protein, more preferably between amino acid position 25 to 250, more preferably between 
position 50 and 150. A coiled coil domain is determining the folding of the protein. 

Two special forms of homology, orthologous and paralogous, are evolutionary concepts used 
to describe ancestral relationships of genes. The term "paralogous" relates to gene- 
duplications within the genome of a species. A "paralogue" is therefore duplication. The term 
"orthologous" relates to homologous genes in different organisms due to ancestral relationship 
and the formation of different species. An orthologue is thus a "speciation". The term 
"homologues" as used herein also encompasses paralogues and orthologues and are useful 
proteins in the methods according to the invention. 

Another variant of seedyl useful in the methods of the present invention is a derivative of 
seedyl. The term "derivatives" refers to peptides, oligopeptides, polypeptides, proteins and 
enzymes which may comprise substitutions, deletions or additions of naturally and non- 
naturally occurring amino acid residues compared to the amino acid sequence of a naturally- 
occurring form of the protein, for example, as presented in SEQ ID NO: 2. "Derivatives" of a 
seedyl protein encompass peptides, oligopeptides, polypeptides, proteins and enzymes which 
may comprise naturally occurring changed, glycosylated, acylated or non-naturally occurring 
amino acid residues compared to the amino acid sequence of a naturally-occurring form of the 
polypeptide. A derivative may also comprise one or more non-amino acid substituents 
compared to the amino acid sequence from which it is derived, for example a reporter 
molecule or other ligand, covalently or non-covalently bound to the amino acid sequence such 
as, for example, a reporter molecule which is bound to facilitate its detection, and non-naturally 
occurring amino acid residues relative to the amino acid sequence of a naturally -occurring 
protein. 

"Substitutional variants" of a protein are those in which at least one residue in an amino acid 
sequence has been removed and a different residue inserted in its place. Amino acid 
substitutions are typically of single residues, but may be clustered depending upon functional 
constraints placed upon the polypeptide; insertions will usually be of the order of about 1 to 10 

14 
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amino acid residues, and deletions will range from about 1 to 20 residues. Preferably, amino 
acid substitutions comprise conservative amino acid substitutions. 

"Insertional variants" of a protein are those in which one or more amino acid residues are 
5 introduced into a predetermined site in a protein. Insertions can comprise amino-terminal 
and/or carboxy-terminal fusions as well as intra-sequence insertions of single or multiple amino 
acids. Generally, insertions within the amino acid sequence will be smaller than amino- or 
carboxy-terminal fusions, of the order of about 1 to 10 residues. Examples of amino- or 
carboxy-terminal fusion proteins or peptides include the binding domain or activation domain of 
10 a transcriptional activator as used in the yeast two-hybrid system, phage coat proteins, 
(histidine)6-tag, glutathione S-transferase-tag, protein A, maltose-binding protein, dihydrofolate 
reductase, Tag-100 epitope, c-myc epitope, FLAG®-epitope, lacZ, CMP (calmodulin-binding 
peptide), HA epitope, protein C epitope and VSV epitope. 

15 "Deletion variants" of a protein are characterised by the removal of one or more amino acids 
from the protein. Amino acid variants of a protein may readily be made using peptide synthetic 
techniques well known in the art, such as solid phase peptide synthesis and the like, or by 
recombinant DNA manipulations. Methods for the manipulation of DNA sequences to produce 
substitution, insertion or deletion variants of a protein are well known in the art. For example, 

20 techniques for making substitution mutations at predetermined sites in DNA are well known to 
those skilled in the art and include M13 mutagenesis, T7-Gen in vitro mutagenesis (USB, 
Cleveland, OH), QuickChange Site Directed mutagenesis (Stratagene, San Diego, CA), PCR- 
mediated site-directed mutagenesis or other site-directed mutagenesis protocols. 

25 Another variant of seedyl useful in the methods of the present invention is an active fragment 
of seedyl. "Active fragments" of a seedyl protein encompasses contiguous amino acid 
residues of a seedyl protein, which residues retain similar biological and/or functional activity 
to the naturally occurring protein. For example, useful fragments comprise at least 10 
contiguous amino acid residues of a seedyl protein. Other preferred fragments are fragments 

30 of the seedyl protein starting at the second or third or further internal methionin residues. 
These fragments originate from protein translation, starting at internal ATG codons. 

In the present invention a new category of proteins and encoding genes is disclosed. 
Surprisingly, altering the availability of such sequences is a plant changes the growth 
35 characteristics of the plant. More particularly the seed quantity and quality is improved in these 
plants. Therefore, these sequences useful for the methods of the present invention are named 
"seedy" sequences. 

15 



105-seedyl-EP 

As the type of proteins defined herein as seedyl proteins was not known, the present invention 
provides for an isolated nucleic acid encoding at least part of a seedyl protein, wherein said 
seedyl protein comprises in the following order from N-terminus to C-terminus at least two 
5 motifs, which motifs are: 

(i) a motif having at least 80% sequence identity to the sequence 
represented by SEQ ID NO 15; and/or 

(ii) a motif having at least 80% sequence identity to the sequence 
represented by SEQ ID NO 16, and/or 

10 (iii) a motif having at least 80% sequence identity to the sequence 

represented by SEQ ID NO 17 and which is a coiled coil motif; and/or 
(iv) a motif having at least 80% sequence identity to the sequence 
represented by SEQ ID NO 18. 

15 In the present invention, a new type of gene has been isolated from tobacco which gene 
encodes a new type of protein. Surprisingly it has been found that this gene and/or protein is 
capable of modifying growth characteristics of a plant. Therefore, the present invention 
provides an isolated nucleic acid selected from the group comprising, 

(i) a nucleic acid represented by any of SEQ ID NO: 1, 5 or 7 the complement 
20 strand thereof; 

(ii) a nucleic acid encoding an amino acid sequence represented by SEQ ID NO: 2, 
4, 6, 8 or 10 or a homologue, derivative or active fragment of the above mentioned 
sequences; 

(iii) a nucleic acid capable of hybridising with a nucleic acid of (i) or (ii) above, which 
25 hybridising sequence preferably encodes a protein having seedyl protein activity; 

(iv) a nucleic acid which is degenerate from any one of the nucleic acids of (i) to (iii) 
above as a results of the genetic code; 

(v) a nucleic acid which is an allelic variant of any one of the nucleic acids of (i) to 
(iv); 

30 (vi) a nucleic acid which is an alternative splice variant of any one of the nucleic 

acids of (i) to (v); 

(vii) a nucleic acid encoding a protein which has at least 21%, 22%, 23%, 24%, 
25%, 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 
39%, 40%, 41%, 42%, 43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%, 60%, 70%, 
35 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to any one 

or more from the sequences defined in (i) to (vi), which protein preferably encodes 
a protein having seedyl activity; 

16 
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(viii) a portion of a nucleic acid according to any of (i) to (vii) above, which portio 
preferably encodes a protein having seedyl activity. 

According to another embodiment of the present invention, there is provided an isolate. 
5 nucleic acid as mentioned herein above, wherein said isolated nucleic acid is not a nucleii 
acid selected from the group comprising: the rice cDNA as deposited under Genbanl 
accession number AK063941 (SEQ ID NO 3), a Medicago BAC clone deposited as AC144618 
or AC139356 or AC144482 or AC135566, the Arabidopsis cDNA deposited under AL61575 
(SEQ ID NO 11), or the Zea mays EST deposited under AY108162 (SEQ ID NO 9). Furthei 
10 the present invention described for the first time a novel type of protein and therefore, within 
the scope of the present invention is an isolated seedyl protein comprising in the following 
order from N-terminus to C-terminus at least two motife, which motifs are: 

0) a motif having at least 80% sequence identity to the sequence represented by 
SEQ ID NO 15; and/or 

15 (H) a motif navln 9 at least 80% sequence identity to the sequence represented by 

SEQ ID NO 16, and/or 

(Hi) a motif having at least 80% sequence identity to the sequence represented by 

SEQ ID NO 1 7 and which is a coiled coil motif; and/or 
(iv) a motif having at least 80% sequence identity to the sequence represented by 
20 SEQ ID NO 18. 

Further, the present invention provides, an isolated seedyl protein, comprising 

a. a polypeptide with an amino acid sequence as presented in any one of SEQ ID NO 
2, 4, 6, 8 or 10; 

25 b - a PO'ypeptide with an amino acid sequence which has at least 21%, 22%, 23%, 

24%, 25%, 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%'. 36%! 37%,' 
38%, 39%, 40%, 41%, 42%, 43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%, 60%,' 
70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to any 
one or more of the amino acid sequence as described in (a) 

30 . c .-^polypeptide which is a homologue, a derivative, an immunologically adivean^ 

functional fragment of a protein as defined in any of (a) or (b). 

According to another embodiment of the present invention, there is provided an isolated 
seedyl protein as mentioned herein above, wherein said protein is not the Arabisopsis protein 
as deposited in Genbank under the accession number AL1 61 572 (SEQ ID NO 12). 
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According to a preferred aspect of the present invention, enhanced or increased expression of 
a seedyl nucleic acid in a plant or plant part Is envisaged. Methods for obtaining increased 
expression of genes or gene products are well documented in the art and include, for example, 
overexpression driven by a (strong) promoter, the use of transcription enhancers or translation 
enhancers. The term overexpression as used herein means any form of expression that is 
additional to the original wild-type expression level. Preferably the nucleic acid to be introduced 
into the plant and/or the nucleic acid that is to be overexpressed in the plants is in the sense 
direction with respect to the promoter to which it is operably linked. Alternatively, selection of 
better performing alleles of the wild-type seedyl nucleic acid can be achieved via plant 
breeding techniques. 

Accordingly, a preferred embodiment of the present invention provides a method to modify 
growthe characteristics in a plant, comprising introducing, into a plant, a nucleic acid capable 
of modifying expression of a seedyl gene and/or capable of modifying activity and/or level of a 
seedyl protein in the sense orientation relative to control element to which it is operably linked. 

Alternatively and/or additionally, increased expression of a seedyl encoding gene or increased 
activities and/or levels of a seedyl protein in a plant cell, is achieved by mutagenesis. For 
example these mutations can be responsible for the changed control of the seedyl gene, 
resulting in more expression of the gene, relative to the wild-type gene. Mutations can also 
cause cx>nformational changes in a protein, resulting in more activity and/or levels of the 
seedyl protein. 

Additionally, these is envisaged by the present invention a method for modifying growth 
characteristics comprising downregulation of expression of a seedyl gene or downregulation 
of levels and/or activity of a seedyl protein. These methods are particularly useful to modify 
the size or quantity or architecture of plants particular plant organs. According to a particular 
embodiment of the present invention, plants are decreased in size or plant organs are 
decreased in size or number, or the plant architecture is changed, for example there is less 
branching or less branching of the inflorescence. Therefore, according to a further aspect of 
the invention, decreased expression of a seedyl nucleic acid or decreased activity and/or level 
of a seedyl is envisaged. 

Examples of decreasing or downregulation of expression are well documented in the art and 
include, for example, downregulation of expression by anti-sense techniques, RNAi 
techniques, small interference RNAs (siRNAs), microRNA (miRMA), etc. Therefore according 
to a particular aspect of the invention, there is provided a method for modifying growth 
characteristics of plants, including technologies that are based on for example the synthesis of 
antisense transcripts, complementary to the mRNA of a seedyl gene. 

18 
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Another method for downregulation of gene expression or gene silencing comprises use o 
ribozymes, for example as described in WO9400012 (Atkins et al.), WO9503404 (Lenee et al ) 
WO0000619 (Nikolau et al.), W09713865 (Ulvskov et al.) and W097381 16 (Scott et al.). 

Gene silencing may also be achieved by insertion mutagenesis (for example, T-DNA insertion 
or transposon insertion) or by gene silencing strategies as described among others in the 
documents WO9836083 (Baulcombe and Angell), WO9853083 (Grierson et al.), W09915682 
(Baulcombe et al.) or WO9953050 (Waterhouse et al.). 

Expression of an endogenous gene may also be reduced if the endogenous gene contains a 
mutation. Such a mutant gene may be isolated and introduced into the same or different plant 
species in order to obtain plants having modified growth characteristics. Also dominant 
negatrve mutants of a seedyl nucleic acid can be introduced in the cell to decrease the 
level/and or activHy of the endogenous Seedyl protein. 

Other methods to decrease the expression of a seedyl nucleic add and/or activity and/or level 
of seedyl proteins in a cell encompass for example the mechanisms of transcriptional gene 
silencing, such as the methylation of the seedyl promoter. 

Another mechanism to downregulate levels and/or activity of a seedyl protein in a plant 
encompasses the mechanism of co-suppression. Modifying gene expression (whether by a 
direct or indirect approach) encompasses changed transcript levels of that gene. Changed 
transcript levels can be sufficient to induce certain phenotypic effects, for example via the 
mechanism of ^suppression. Here the overall effect of expression of a transgene is that there 
is less activity in the cell of the protein encoded by a native gene having homology to the 
introduced transgene. Cosuppression is accomplished by the addition of coding sequences or 
parts thereof in a sense orientation into the cell. Therefore, according to one aspect of the 
present invention, the development of a plant may be changed by introducing into a plant an 
additional copy_(in full or in part) of -a seedyl gene already present in a host plant The 
additional gene may silence the endogenous gene, giving rise to a phenomenon known as co- 
suppression. 

Genetic constructs aimed at silencing gene expression may comprise the seedyl nucleotide 
sequence or one at least a portion thereof in a sense and/or antisense orientation relative to 
the promoter sequence. Preferably the portions comprises at least 21 contiguous nucleic acid 
of a sequence to be downregulated. Also, sense or antisense copies of at least part of the 
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endogenous gene in the form of direct or inverted repeats may be utilised in the methods 
accenting to the invention. The development of plants may also be changed by intreducing ,r*> 
a plant at least part of an antisense version of the nucleotide sequence represented, for 
example by SEQ ID NO: 1. It should be clear that part of the nucleic acid (a portion) could also 
5 achieve 'the desired result. Homologous anturense genes are preferred, homologous genes 
being plant genes, preferably plant genes from the same plant species in which the silencing 
construct is introduced. 

The expression of a seedyl gene can be investigated by Northern or Southern blot analysis of 
10 cell extracts. The levels of seedy! pretein in the cell can be investigated via Western blot 
analysis of cell extracts. 

According to a further embodiment of the present invention, genetic constructs and vectors to 
facilitate introduction and/or expression of the nucleotide sequences useful in the methods 
15 according to the invention are provided. Therefore, according to the further embodiment, the 
present invention provides a genetic construct comprising: 

(i) a at least part of a nucleic acid of the present invention as described above or at 
least a part of a nucleic acid encoding a seedyl protein as defined above; 

(ii) one or more control sequences capable of regulating expression of the nucleic acd 

20 of (i); and optionally 

(Hi) a transcription termination sequence. 

According to methods of the present invention, such a genetic construct is introduced into a 
plant or plant part. 

Constructs useful in the methods according to the present invention may be constructed using 
recombinant DNA technology well known to persons skilled in the art. The gene constructs 
may be inserted into vectors, which may be commercially available, suitable for transferring 
into plants and suitable for expression of the gene of interest in the transformed cells. 
The genetic construct can be an expression vector wherein said nucleic acid is operably l.nked 
to one or more control sequences allowing expression in prokaryotic and/or eukaryotic host 
cells. 

The nucleic acid according to (i) is advantageously any of the aforementioned nucleic acids, 
preferably any of the above mentioned variants of a seedyl nucleic acid, most preferably a 
seedyl nucleic acid, such as for example one of the nucleic acids of SEQ ID NO 1 . 3, 5, 7, 9 or 
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11. The construct sequence of (ii) is preferably a seed-preferred promoter, for example ■ 
prolamin promoter 

According to a preferred embodiment of the invention, the genetic construct is an expressio. 
vector designed to overexpress the nucleic acid. A preferred nucleic acid (i) in thh 
overexpression vector is the sequence represented by SEQ ID NO 1, 3, 5, 7, 9 or 11 or t 
portion thereof or sequences capable of hybridising therewith or a nucleic acid encoding s 
sequence represented by SEQ ID NO 2. 4, 6, 8, 10 or 12 or a homologue, derivative or active 
fragment thereof. Preferably, this nucleic acid is cloned in the sense orientation relative to the 
control sequence. 

The methods according to the present invention may also be practised by introducing into a 
plant at least a part of a (natural or artificial) chromosome (such as a Bacterial Artificial 
Chromosome (BAC)), which chromosome contains at least a seedyl nucleic acid, optionally 
together with one or more related gene family members. Therefore, according to a further 
aspect of the present invention, there is provided a method for modifying growth characteristics 
of a plant by introducing into a plant at least a part of a chromosome comprising at least a 
seedyl gene/nucleic, which seedyl nucleic is preferably one represented by any one of SEQ 
ID NO 1,3, 5, 7, 9 or 11. 



Plants are transformed with a vector comprising the sequence of interest (i.e., the nucleic acid 
capable of modifying expression of seedyl nucleic acid), which sequence is operably linked to 
one or more control sequences (at least a promoter). The terms "regulatory element", "control 
sequence" are all used herein interchangeably and are to be taken in a broad context to refer 
to regulatory nucleic acids capable of effecting expression of the sequences to which they are 
ligated(i.e. operably linked). Encompassed by the aforementioned terms are promoters A 
"Promoter" encompasses transcriptional regulatory sequences derived from a classical 
eukaryotic genomic gene (including the TATA box which is required for accurate transcription 
rotation, with or without a CCAAT box sequence) and additional regulatory elements (i e 
upstream activating sequences, enhancers and silencers) which alter gene expression in 
response to developmental and/or external stimuli, or in a tissue-specific manner Also 
.ncluded within the term is a transcriptional regulatory sequence of a classical prokaryotic 
gene, in which case it may include a -35 box sequence and/or -10 box transcriptional 
regulatory sequences. The term "regulatory element" also encompasses a synthetic fusion 
molecule or derivative which confers, activates or enhances expression of a nucleic acid 
molecule in a cell, tissue or organ. The term "operably linked" as used herein refers to a 
functional linkage between the promoter sequence and the gene of interest, such that the 
promoter sequence is able to initiate transcription of the gene of interest. 
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Advantageously, any type of promoter may be used to drive expression of the nucleic acid 
depending on the desired outcome. For example, a meristem-specrfic promoter, such as the 
mr (ribonucleotide reductase), cdc2a promoter and the cyc07 promoter. Also seed -specific 
5 promoter, such as P 2S2, pPROLAMIN, pOLEOSIN could be selected. An aleurone-specific 
promoter may be selected. An inflorescence-specific promoter, such as pLEAFY, may also be 
utilised. To produce male-sterile plants one would need an anther specific promoter. One could 
also choose a petal-specific promoter. If the desired outcome would be to modify plant growth 
characteristics In particular organs, then the choice of the promoter would depend on the organ 
10 to be changed. For example, use of a root-specific promoter would lead to phenotyp.c 
alteration of the root. This would be particularly important where it is the root itself that is the 
desired end product; such crops include sugar beet, turnip, carrot, and potato. A fruit-specific 
promoter may be used to modify, for example, the strength of the outer skin of the fruit or to 
increase the size of the fruit A green tissue-specific promoter may be used to influence the 
15 phenotype pf the leaf. A cell wall-specific promoter may be used to increase the rigidity of the 
cell wall, thereby increasing resistance of the cell. An anther-specific promoter may be used to 
produce male-sterile plants. A vascular-specific promoter may be used to increase transport 
from leaves to seeds. A nodule-specific promoter may be used to increase the nitrogen fixing 
capabilities of a plant, thereby increasing the nutrient levels in a plant. A stress-inducble 
20 promoter may also be used to drive expression of a nucleic acid during conditions of stress. A 
stress inducible promoter such as the water stress induced promoter WSI18, the drought 
stress induced Trg-31 promoter, the ABA related promoter rab21 or any other promoter wh.ch 
is induced under a particular stress condition such as temperature stress (cold, freezing, heat) 
or osmotic stress, or drought stress or oxidative stress or biotic stress can be used to drive 
25 expression of a seedyl gene. 

Preferably, the nucleic acid capable of modifying expression of a seedyl gene is operably 
linked to a plant promoter, preferably a tissue preferred promoter. The term "tissue-preferred" 
as defined herein refers to a promoter that is expressed predominantly in at least one tissue or 

30 organ. Preferably, the tissue-preferred promoter is a seed-preferred promoter or a seed- 
specific promoter, ferther preferably an endosperm-specific promoter, more preferably a 
promoter isolated from a gene encoding a seed-storage protein most preferably a promoter 
isolated from a prolamin gene, such as for example a rice prolamin promoter as presented .n 
SEQ ID NO 14 or a promoter of similar strength and/or a promoter with a similar expression 

35 pattern as the rice prolamin promoter. Similar strength and/or similar expression pattern can be 
analysed for example by coupling the promoters to a reporter gene and check the funcfon of 
the reporter gene in tissues of the plant. One suitable reporter gene is beta-glucuronidase and 
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the colorlmetric GUS staining to visualize the beta-glucuronidase activity in a plant tissue i 
well known to a person skilled in the art. 

Examples of preferred seed-specific promoters and other tissue-specific promoters are 
presented in Table I, which promoters or derivatives thereof are useful in performing the 
methods of the present invention. Accordingly, genetic constructs comprising at least part of * 
seedyl nucleic acid and at least part of a promoter from table I, preferably, wherein said part 
are operably linked, is also provided by the present invention. 



TABLE I 

EXEMPLAKY SEED-PREFERRED PROMOTERS FOR USE iN IHCTf^R^i/^p O 
THE PRESENT INVENTION 



IGENE SOURCE 


J EXPRESSION PATTERN 


I REFERENCE 


1 seed-specific genes 


seed " ' 


Simon, et a/.. Plant MoL Biol. 5: 191 
1985; Scofield, et al J Biol Chan- 
262: 12202, 1987.; Baszczynski, et al. 
Plant MoL Biol. 14:633, 1990. . 


I Brazil Nut albumin 


seed 


Pearson, et al., Plant Mol. Biol. 18: 235 
245, 1992. 


legumin 


seed 


Elhs, et al., Plant Mol. Biol. 10: 203-214 
1988. 


glutelin (nee) 


seed 


Takaiwa, et al., Mol. Gen. Genet. 208: 
15-22, 1986; Takaiwa, et al., FEBS 
Letts. 221: 43-47. 1987. 


zein 


seed 


Matzke et al Plant Mol Biol, 14(3):323- 
32 1990 


napA ——————— 

wheat LMW and HMW 
glutenin-1 


seed 

endosperm ] 
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Optionally, one or more terminator sequences may also be used in the construct introduced 
into a plant. The term "terminator" encompasses a control sequence which is a DNA sequence 
at the end of a transcriptional unit which signals 3* processing and polyadenylation of a primary 
5 transcript and termination of transcription. Additional regulatory elements may include 
transcriptional as well as translational enhancers. Those skilled in the art will be aware of 
terminator and enhancer sequences, which may be suitable for use in performing the 
invention. Such sequences would be known or may readily be obtained by a person skilled in 
the art. 

10 

The genetic constructs of the invention may further include an origin of replication sequence 
which is required for maintenance and/or replication in a specific cell type. One example is 
when a genetic construct is required to be maintained in a bacterial cell as an episomal genetic 
element (e.g. plasmid or cosmid molecule). Preferred origins of replication include, but are not 
15 limited to, the fl-ori and colE1. 

The genetic construct may optionally comprise a selectable marker gene. As used herein, the 
term "selectable marker gene" includes any gene, which confers a phenotype on a cell in 
which it is expressed to facilitate the identification and/or selection of cells which are 
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transfected or transformed with a nucleic acid construct of the invention. Suitable markers ma 
be selected from markers that confer antibiotic or herbicide resistance, that introduce a nev 
metabolic trait or that allow visual selection. Examples of selectable marker genes include 
genes conferring resistance to antibiotics (such as nptll encoding neomycir 
phosphotransferase capable of phosphorylating neomycin and kanamycin, or hpt encodinj 
hygromycin phosphotransferase capable of phosphorylating hygromycin), to herbicides (foi 
example bar which provides resistance to Basta; aroA or gox providing resistance against 
glyphosate), or genes that provide a metabolic trait (such as manA that allows plants to use 
mannose as sole carbon source). Visual marker genes result in the formation of colour (foi 
example beta-glucuronidase, GUS), luminescence (such as luciferase) or fluorescence (Green 
Fluorescent Protein, GFP, and derivatives thereof). Further examples of suitable selectable 
marker genes include the ampicillin resistance (Ampr), tetracycline resistance gene (Tcr), 
bacterial kanamycin resistance gene (Kanr), phosphinothricin resistance gene, and the 
chloramphenicol acetyltransferase (CAT) gene, amongst others 

In a preferred embodiment, the genetic construct as mentioned above, comprises at least part 
of a rice prolamin promoter operably linked to a seedyl nucleic acid in the sense orientation. 
An example of such an expression cassette, further comprising a terminator sequence s 
presented in SEQ ID NO 13. 

Therefore the present invention provides an isolated genetic construct, comprising a nucleic 
acid selected from the group comprising: 

a. A nucleic acid having at least a part of a seed-preferred promoter and at least a part 
of the nucleic acid as defined herein above 2; and 

b. A nucleic acid as presented in SEQ ID NO 13, or the complementary strand thereof; 

c. a nucleic acid which is degenerated as a result of the genetic code from any of the 
nucleic acids of (a) or (b); 

d. a nucleic acids which is an allelic variant of any of the nucleic acids of (a) or (b); 

e. a nucleic acid which hybridizes to any of the nucleic acids of (a) or (b). 

According to a further embodiment of the present invention, there is provided a method for the 
production of a plant having modified growth characteristics, comprising modifying expression 
and or activity and/or levels in a plant of a seedyl nucleic acid or seedyl protein. ' 
According to a particular embodiment, the present invention provides a method for the 
production of a transgenic plant having modified growth characteristics, which method 
comprises: 
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(i) introducing into a plant or plant part a nucleic acid capable of modifying 
expression of a seedyl gene and/or capable of modifying the activity and/or 
levels of a seedyl protein; 

(ii) cultivating the plant cell under conditions promoting regeneration and mature 

5 plant growth. 

The nucleic acid of (i) may advantageously be any of the aforementioned nucleic acids, 
preferably a nucleic acid encoding a coiled coil protein, more preferably encoding a seedyl 
most preferably a seedyl nucleic acid according to SEQ ID NO 1 , 3, 5, 7, 9, or 1 1 . 
Preferably in the above mentioned method, the nucleic acid of (i) is overexpressed in the 

10 plant, is in the sense direction and/or is driven by a seed-preferred promoter, such as the 
proiamin promoter. 

The protein Itself and/or the nucleic acid itself may be introduced directly into a plant cell or into 
the plant itself (including introduction into a tissue, organ or any other part of the plant). 
15 According to a preferred feature of the present invention, the nucleic acid is preferably 
introduced into a plant by transformation. 

The term -transformation- as referred to herein encompasses the transfer of an exogenous 
polynucleotide into a host cell, irrespective of the method used for transfer. Plant tissue 
20 capable of subsequent clonal propagation, whether by organogenesis or embryogenesis, may 
be transformed with a genetic construct of the present invention and a whole plant regenerated 
therefrom. The particular tissue chosen will vary depending on the clonal propagation systems 
available for, and best suited to, the particular species being transformed. Exemplary tissue 
targets include leaf disks, pollen, embryos, cotyledons, hypocotyls, megagametophytes, callus 
25 tissue, existing meristematic tissue (e.g., apical meristem, axillary buds, and root meristems), 
and induced meristem tissue (e.g. cotyledon meristem and hypocotyl meristem). The 
polynucleotide may be transiently or stably introduced into a host cell and may be maintained 
non-integrated, for example, as a plasmid. Alternatively and preferably, the transgene may be 
stably integrated into the host genome. The resulting transformed plant cell can then be used 
30 to regenerate a transformed plant in a manner known to persons skilled in the art. 

Transformation of a plant species is now a fairly routine technique. Advantageously, any of 
several transformation methods may be used to introduce the gene of interest into a suitable 
ancestor cell. Transformation methods include the use of liposomes, electroporation, 
35 chemicals that increase free DMA uptake, injection of the DNA directly into the plant, particle 
gun bombardment, transformation using viruses or pollen and microprojection. Methods may 
be selected from the calcium/polyethylene glycol method for protoplasts (Krens, F.A. et al., 
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1882, Nature 296, 72-74; Negrutiu I. et al., June 1987, Plant Mol. Biol. 8, 363-373 
electroporation of protoplasts (Shillito R.D. et al., 1985 Bio/Technol 3, 1099-1102 
microinjection into plant material (Crossway A. et al., 1986, Mol. Gen Genet 202, 179-185 
DNA or RNA-coated particle bombardment (Klein T.M. et al., 1987, Nature 327, 70) infectioi 
with (non-integrative) viruses and the like. 

Transgenic rice plants expressing a seedyl gene are preferably produced via Agmbacterium 
mediated transformation using any of the well known methods for rice transformation, such ai 
described in any of the following: published European patent application EP 1198985 A1 
Aldemita and Hodges (Planta, 199, 612-617, 1996); Chan et al. (Plant Mol. Biol. 22 (3) 491- 
506, 1993), Hiei et al. (Plant J. 6 (2) 271-282. 1994), which disclosures are incorporated by 
reference herein as if fully set forth. In the case of corn transformation, the preferred method is 
as described in either Ishida et al. (Nat. Biotechnol. 1996 Jun; 14(6): 745-50) or Frame et al 
(Plant Physiol. 2002 May; 129(1): 13-22), which disclosures are incorporated by reference 
herein as if fully set forth. 

Generally after transformation, plant cells or cell groupings are selected for the presence of 
one or more markers which are encoded by plant-expressible genes co-transferred with the 
gene of interest, following which the transformed material is regenerated into a whole plant 

Following DNA transfer and regeneration, putatively transformed plants may be evaluated for 
instance using Southern analysis, for the presence of the gene of interest, copy number and/or 
genomic organisation. Alternatively or additionally, expression levels of the newly introduced 
DNA may be monitored using Northern and/or Western analysis, both techniques being well 
known to persons having ordinary skill in the art. 

The generated transformed plants may be propagated by a variety of means, such as by clonal 
propagation or classical breeding techniques. For example, a first generation (or T1) 
transformed plant may be selfed to give homozygous second generation (or T2) transformants, 
and the T2 plants further propagated through classical breeding techniques. 

The generated transformed organisms may take a variety of forms. For example, they may be 
chimeras of transformed cells and non-transformed cells; clonal transformants (e.g., all cells 
transformed to contain the expression cassette); grafts of transformed and untransformed 
tissues (e.g., in plants, a transformed rootstock grafted to an untransformed scion). 

The present invention also encompasses plants obtainable by the methods according to the 
present invention. The present invention therefore provides plants obtainable by the method 
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according to the present invention, which plants have modified growth characterises, when 
compared to the wild-type plants. 

The present invention clearly extends to any plant cell or plant produced by any of the methods 
5 described herein, and to all plant parts and propagules thereof. The present invention extends 
further to encompass the progeny of a primary transformed or transfected cell, tissue, organ or 
whole plant that has been produced by any of the aforementioned methods, the only 
requirement being that progeny exhibit the same genotypic and/or phenotypic characterises) 
as those produced in the parent by the methods according to the invention i.e. having modified 
10 growth characteristics. The invention accordingly also includes host cells having modrfied 
expression of a nucleic acid according to the invention and described herein before and/or 
having modified activity and/or level of a seedyl protein or a protein according to the present 
invention, wherein said expression, level or activity is modrfied relative to corresponding w.ld 
type host cells. Preferred host cells according to the invention are plant cells or cells from 
15 insects, animals, yeast, fungi, algae or bacteria. The invention also extends to harvestabie 
parts of a plant such as but not limited to seeds, flowers, stamen, leaves, petals, frurts, stem, 
stem cultures, rhizomes, roots, tubers and bulbs. 

Preferably said host cells is transformed with a seedyl encoding gene or a genetic construct 
20 as described above wherein the seedyl nucleic acid is under the control of a seed-preferred 
promoter and more preferably the plants of the present invention carry an expression cassette 
comprising at least a part of seedyl and at least a part of a constitutive promoter as mentioned 
hereinabove. The host cells, plants or the plant parts of the present invention can be idenffied 
by the presence of higher expression of a seedyl gene and/or or a higher level and/or activrty 
25 of a seedyl protein. Further, particular plants of the present invention are recognizable by the 
presence of a seedyl transgene. 

The term "plant" as used herein encompasses whole plants, ancestors and progeny of the 
plants and plant parts, including seeds, shoots, stems, roots (including tubers), and plant cells, 

30 tissues and organs. The term "plant" also therefore encompasses suspension cultures, 
embryos, meristematic regions, callus tissue, leaves, gametophytes, sporophytes, pollen, and 
microspores. Plants that are particularly useful in the methods of the invention include all 
plants which belong to the superfamily Viridiplantae, in particular monocotyledonous and 
dicotyledonous plants including a fodder or forage legume, ornamental plant, food crop. tree. 

35 or shrub selected from the list comprising Acacia spp., Acer spp., Actinidia spp., Aesculus 
spp Agathis australis, Albizia amara, Alsophila tricolor, Andropogon spp., Arachis spp, Araca 
catechu, Astelia fragmns, Astragalus clear, Baikiaea plurijuga, Betula spp., Brassica spp., 
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Bruguiera gymnorrhiza, Burkea africana, Butea frondosa, Cadaba farinosa, Calliandra spf 
Camellia sinensis, Canna indica, Capsicum spp., Cassia spp., Centroema pubescent 
Chaenomeles spp., Cinnamomum cassia, Coffea arabica, Colophospermum mopam 
Coronillia varia, Cotoneaster serotina, Crataegus spp., Cucumis spp., Cupressus spp 
5 Cyathea dealbata, Cydonia oblonga, Cryptomeria japonica, Cymbopogon spp., Cynthe 
dealbata, Cydonia oblonga, Dalbergia monetaria, Davallia divaricate, Desmodium spp 
Dicksonia squarosa, Diheteropogon amplectens, Dioclea spp, Dolichos spp., Dorycniui 
rectum, Echinochloa pyramidalis, Ehrartia spp., Eleusine coracana, Eragrestis spp., Erythrin 
spp., Eucalyptus spp., Euclea schimperi, Eulalia villosa, Fagopyrum spp., Feijoa sellowiane 

10 Fragaria spp., Flemingia spp, Freycinetia banksii, Geranium thunbergii, Ginkgo biloba, Glycin 
javanica, Gliricidia spp, Gossypium hirsutum, Grevillea spp., Guibourtia coleosperme 
Hedysarum spp., Hemarthia altissima, Heteropogon contortus, Hordeum vulgare, Hyparrheni 
rufa, Hypericum erectum, Hyperthelia dissolute, Indigo incarnate, Iris spp., Leptarrhen 
pyrottfolia, Lespediza spp., Lettuca spp., Leucaena leucocephala, Loudetia simplex, Lotonu 

15 bainesii, Lotus spp., Macrotyloma axillare, Malus spp., Manihot esculenta, Medicago sative 
Metesequoia glyptostroboides, Musa sapientum, Nicotianum spp., Onobrychis spp 
Omithopus spp., Oryza spp., Peltophorum africanum, Pennisetum spp., Persea gratissimz 
Petunia spp., Phaseolus spp., Phoenix canariensis, Phormium cookianum, Photinia spp 
Picea glauca, Pinus spp., Pisum sativum, Podocarpus totera, Pogonarthria flecki 

20 Pogonarthria squarrosa, Populus spp., Prosopis cineraria, Pseudotsuga menziesi 
Pterolobium stellatum, Pyrus communis, Quercus spp., Rhaphiolepsis umbellate 
Rhopalostylis sapida, Rhus natalensis, Ribes grossularia, Ribes spp., Robinia pseudoacacia 
Rosa spp., Rubus spp., Salix spp., Schyzachyrium sanguineum, Sciadopitys verticillate 
Sequoia sempervirens, Sequoiadendron giganteum, Sorghum bicolor, Spinacia spp. 

25 Sporobolus fimbriatus, Stiburus alopecuroides, Stylosanthos humilis, Tadehagi spp, Taxodiur, 
distichum, Themeda triandra, Trifolium spp., Trib'cum spp., Tsuga heterophylla, Vacciniur, 
spp., Vicia spp., Vitis vinrfera, Watsonia pyramidata, Zantedeschia aethiopica, Zea mayi 
amaranth, artichoke, asparagus, broccoli, Brussels sprouts, cabbage, canola, carrol 
cauliflower, celery, collard greens, flax, kale, lentil, oilseed rape, okra, onion, potato, rice 

30 soybean, straw, sugar beet, sugar cane, sunflower, tomato, squash tea, trees. Alternative!] 
algae and other non-Viridiplantae can be used for the methods of the present inventior 
Preferably the plant according to the present invention is a crop plant selected from rice 
maize, wheat, barley, millet, oats, rye, soybean, sunflower, canola, sugarcane, alfalfe 
leguminosae (bean, pea), flax , lupinus, rapeseed, tobacco, tomato, potato, squash, papaye 

35 poplar and cotton. Further preferably, the plant according to the present invention is ; 
monocotyledonous plant, most preferably a cereal. 
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Advantageously, the present invention provides a method for modifying growth characteristics 
of a plant According to further embodiments, the invention provides methods for modifying 
plant growth characteristics, wherein said modified growth characteristics are selected from 
any one or more of increased yield, increased biomass, modified plant architecture. 

Further preferabiy, increased yield is increased seed yield en encompasses increased number 
of filled seeds and/or increased total seed weight per plant. 

The term "increased yield" encompasses an increase in biomass in one or more harvestabie 
parts of a plant relative to the total biomass of corresponding wild-type plante. The term also 
encompasses an increase in seed yield, which includes an increase in the biomass of the seed 
(seed weight) and/or an increase in the number of (filled) seeds and/or in the size of the seeds 
and/or an increase in seed volume, each relative to corresponding wild-type plants. An 
increase in seed sfce and/or volume may also influence the composition of seeds. An .ncrease 
in seed yield could be due to an increase in the number and/or size of flowers. An increase ,n 
yield might also increase the harvest index, which is expressed as a ratio of the total biomass 
over the yield of harvestabie parts, such as seeds. 

The methods of the present invention are used to increase the seed yield of the plant and are 
therefore particularly favourable to be applied to crop plants, preferably seed crops and 
cereals Therefore, the methods of the present invention are particularly useful for plants such 
as, rapeseed, sunflower, teguminosae (e.g. soybean, pea, bean) flax, lupinus, canola and 
cereals such as rice, maize, wheat, barley, millet, oats and rye. 

Further preferably, increased biomass encompasses increased biomass of above-ground plant 
tissue, herein determined as above-ground plant area. 

Additionally or alternatively, the plants according to the invention have more above ground 
area Therefore, the methods of the present invention are additionally and/or alternatively 
particularly favourable to crops grown for the green tissue and/or grown for the above ground 
biomass. The methods of the present invention are particularly usefa. for increasing leaf s,ze 
and number of grasses and forage crops (such as forage maize, clover, medicago alfalfa etc.) 
The methods of the present invention are also particularly useful for increasing the stem s.ze of 
trees (for paper and pulp industry) and sugar cane. 

Further preferably, said modified plant architecture encompasses increased number of 

; ZlTthods of the present invention clearly change the appearance or morphology of a plant 
including any one or more structural features or combination of stroctura. features thereof. 
Therefore the plants according to the present invention have changed architecture when 
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compared to the wild-type plants. Other structural features, which may be altered by tt 
methods of the present invention include shape, size, number, position, texture, arrangemer 
and pattern of any cell, tissue or organ or groups of ceils, tissues or organs of a plant, includir 
the root, leaf, shoot, stem or tiller, petiole, trichome, flower, inflorescence (f 
5 monocotyledonous and dicotyledonous plants), panicles, petal, stigma, style, stamen, polle 
ovule, seed, embryo, endosperm, seed coat, aleurone, fibre, cambium, wood, heartwoo 
parenchyma, aerenchyma, sieve elements, phloem or vascular tissue, amongst others. 

The present invention also relates to use of a seedyl nucleic acid and and/or protein i 
10 modifying growth characteristics and to compositions therefore. According to a particul; 
embodiment, the methods of the present invention are used to change growth characteristic 
of a plant, wherein said growth characteristics is not disease or pathogen resistance, or is m 
stress tolerance. 

IS The present invention also relates to use of a seedyl nucleic acid and and/or protein as 
growth regulator, such as a herbicide or growth stimulator and to compositions therefore. 
The present invention also relates to use of a seedyl nucleic acid and and/or protein as 
target for an agrochemical compound, such as a herbicide or growth stimulator and 1 
compositions therefore. 

20 

According to another aspect of the present invention, advantage may be taken of tti 
nucleotide sequence capable of modifying expression of a seedyl nucleic acid in breedin 
programmes. The nucleic acid may be on a chromosome, or a part thereof, comprising at lea 
the seedyl nucleic acid and preferably also one or more related family members. In a 

25 example of such a breeding programme, a DNA marker is identified which may be genetical 
linked to a gene capable of modifying expression of a seedyl nucleic acid in a plant, whic 
gene may be a gene encoding the seedyl protein itself or any other gene which may direct 
or indirectly influence expression of the seedyl gene and/or activity of the seedyl protein itsel 
This DNA marker may then be used in breeding programs to select plants having change 

30 development . — 

Further the use of allelic variants as described herein-above are particular ly useful fc 
conventional breeding programmes, such as in marker-assisted breeding, which is als 
encompassed by the present invention. Such breeding programmes sometimes require th 
35 introduction of allelic variations in the plants by mutagenic treatment of a plant. One suitabl 
mutagenic method is EMS mutagenesis. Identification of allelic variants then may take plac 
by, for example, PCR. Tilling is preferred for identifying allelic variants. This is followed by 
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seiection step for selection of superior allelic variants of the seedyl sequence and which give 
rise to changed development in a plant. Selection, according to the method of the present 
invention, is typically carried out by monitoring development, differentiation and organ 
formation of plants containing different allelic variants of the seedyl sequence, for example, 
5 different allelic variants of SEQ ID NO: 1 or of a seedyl orthologue in that plant. Monitoring 
growth performance can be done in a greenhouse or in the field. Further optional steps include 
crossing plants, in which the superior allelic variant was identified, with another plant. This 
could be used, for example, to make a combination of interesting phenotypic features. 
Therefore, mutations in the seedyl gene may occur naturally, and may form the basis of the 
10 selection of plants showing accelerated rate of development, increased organ size and/or 
number, and/or early flowering. 

Accordingly, as another aspect of the invention, there is provided a method for the selection of 
plants having modified growth characteristics, which method is based on the selection of 
better-performing allelic variants of the seedyl sequence relative to the wild-type allele, and 
15 which give rise to modified growth characterstics in a plant. 

According to a related embodiment of the invention, there is provided a method for breeding 
plants having modified growth characteristics, comprising the steps of: 

a. Growing a plant with modified expression of a nucleic acid of the present 
20 invention as described herein above; and 

b. Crossing the plant of (a) with another plant; and 

c. selecting progeny of the cross of (b) having modified growth characteristic; 
wherein said growth characteristics are modified relative to the corresponding wild-type 
plants. 

25 The cross of step be is for example a cross with a commercial important germplasm. 

Accordingly, the present invention also relates to the use of a seedyl gene in breeding 
programmes. 

Alternatively, the seedyl gene itself can be used as a (genetic) marker to detect the presence 
30 or absence of a desired trait, or Quantitative Trait Locus (QTLs). In this application of the 
present invention the gene encoding seedyl is genetically linked to the desired trait, and 
typically the phenotypes caused by the gene encoding a seedyl are monitored in order to 
breed and select plants with the desired trait. This desired trait or QTL, may comprise a single 
gene or a cluster of linked genes that affect the desired trait. 

35 

In molecular biology it is standard practice to select upon transfection or transformation those 
individuals (or groups of individuals, such as bacterial or yeast colonies or phage plaques oi 
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eukaryotic cell clones) that are effectively transfected or transformed with the desired genetii 
construct. Typically these selection procedures are based on the presence of a selectable o 
screenable marker in the transfected genetic construct, to distinguish the positive individuals 
easily from the negative individuals. Therefore, the seedyl gene can also be used for these 
5 purposes, since introduction of this gene into a host cell results in changed development c 
said host cell. 

The methods according to the present invention may also be practised by co-expression of '< 
seedyl gene in a plant with at least one other gene that cooperates with the seedyl gene. Co 
10 expression may be effected by cloning the genes under the control of a plant expressibli 
promoter in a plant expressible vector and introducing the expression vector(s) into a plant eel 
using Agrobacterium-mediated plant transformation. 

The methods according to the present invention result in plants having modified growtf 
15 characteristics, as described hereinbefore. These advantageous characteristics may also be 
combined with other economically advantageous traits, such as further yield-enhancing traits 
tolerance to various stresses, traits modifying various architectural features and/or biochemica 
and/or physiological features. Accordingly, the methods of the present invention can also be 
used in so-called "gene stacking" procedures. 

20 

Also the present invention encompasses a food product derived from any of the plants 
produced by the methods of the present invention. Further the invention also refers to the use 
of a product derived from any of the plants according to the present invention in animal feec 
and in food or in the production procedures thereof. 

25 

In a particular embodiment of the invention the plants with improved growth characteristics 
resulting in improved yield and/or biomass, are used to produce industrial enzymes and/ oi 
pharmaceuticals. The production of such enzymes or pharmaceuticals in plants is aimed a 
high accumulation of the desired products in a particular and easy to harvest plant tissues, foi 

30 example accumulation in the leaves and/or in the seeds. The plants of the present inventior 
have more seeds and more above ground biomass, and therefore are capable of producing 
more industrial enzymes and/or pharmaceuticals in these tissues, more particularly in theii 
green biomass and/or in their seeds. Accordingly, the present invention also provides s 
method for the production of enzymes and/or pharmaceuticals, which method comprises the 

35 modifying of expression of a seedyl gene or the modifying of activity and/or level of a seedyl 
protein. Further the invention relates to the use of plants according to the invention for the 
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production of industrial enzymes and pharmaceuticals and the invention extends to the 
industrial enzymes and pharmaceuticals produced according to these methods. 

Description of the Figures 

5 The present invention will now be described with reference to the following figures in which: 

Figure 1 is a schematic presentation of the entry clone, containing CDS0689 within the AttL1 
and AttL2 sites for Gateway® cloning in the pDONR201 backbone. CDS0689 is the internal 
code for the Nicotians tabacum BY2 CDS0689 seedyl coding sequence. This vector contains 
10 also a bacterial kanamycine-resistance cassette and a bacterial origin of replication. 

Figure 2 is a map of the binary vector for the expression in Oryza sativa of the Nicotiana 
tabacum BY2 seedyl gene (CDS0689) under the control of the rice prolamin RP6 promoter 
(PRO0090). This vector contains a T-DNA derived from the Ti Plasmid, limited by a left border 

15 (LB repeat, LB Ti C58) and a right border (RB repeat, RB Ti C58)). From the left border to the 
right border, this T-DNA contains: a selectable marker cassette for antibiotic selection of 
transformed plants; a screenable marker cassette for visual screening of transformed plants; 
the PRO0090 - CDS0689 -zein and rbcS-deltaGA double terminator cassette for expression of 
the Nicotiana tabacum BY2 seedyl gene (CDS0689) . This vector also contains an origin of 

20 replication from pBR322 for bacterial replication and a selectable marker (Spe/SmeR) for 
bacterial selection with spectinomycin and streptomycin. 

Figure 3 is the alignment of seedyl nucleic acids and ESTs from different plant species. This 
alignment was made with the program align X of the NVTi software package. The motifs 1, 2, 3 
25 and 4 are indicated with a line. 

Figure 4 is the representation of nucleic acids, protein and motif sequences according to the 
invention. 

30 Examples 

The present invention will now be described with reference to the following examples, which 
are by way of illustration alone. 

Unless otherwise stated, recombinant DNA techniques were performed according to standard 
35 protocols described in Sambrook (2001) Molecular Cloning: a laboratory manual, 3rd Edition 
Cold Spring Harbor Laboratory Press, CSH, New York; or in Volumes 1 and 2 of Ausubel etal. 

36 



105-seedy1-EP 

(1988), Current Protocols in Molecular Biology, Current Protocols. Standard materials ar 
methods for plant molecular work are described in Plant Molecular Biology Labfase (1993) i 
R.D.D. Cray, published by BIOS Scientific Publications Ltd (UK) and Blackwell Scientii 
Publications (UK). 

5 

Example 1: cloning of the seedy 1 encoding gene 

A cDNA-AFLP experiment was performed on a synchronized tobacco BY2 cell cultu 
(Nicotiana tabacum L. cv. Bright Yellow-2), and BY2 expressed sequence tags that were a 
cycle modulated were identified and elected for further cloning. Subsequently, the express* 
10 sequence tags were used to screen a tobacco cDNA library and to isolate the fulHength cDN 
of interest, namely the cDNA coding for the seedyl protein of the present inventic 
(CDS0689). 

Synchronization of BY2 cells. 

IS Tobacco BY2 ( Nicotiana tabacum L. cv. Bright Yellow - 2) cultured cell suspension W£ 
synchronized by blocking cells in early S-phase with aphidicolin as follows. Cultured oc 
suspension of Nicotiana tabacum L. cv. Bright Yellow 2 were maintained as described (Naga 
et al. Int. Rev. Cytol. 132, 1-30, 1992). For synchronization, a 7-day-old stationary culture w€ 
diluted 10-fold in fresh medium supplemented with aphidicolin (Sigma-Aldrich, St. Louis, MO; 

20 mg/l), a DNA-polymerase a inhibiting drug. After 24 h, cells were released from the block t 
several washings with fresh medium and resumed their cell cycle progression. 

RNA extraction and cDNA synthesis. 

Total RNA was prepared by using LiCI precipitation (Sambrook et al, 2001) and poly(A+) RN 
25 was extracted from 500 mg of total RNA using Oligotex columns (Qiagen, Hilden, German; 
according to the manufacturer's instructions. Starting from 1 mg of poly(A+) RNA, first-strar 
cDNA was synthesized by reverse transcription with a biotinylated oligo-dT25 primer (Gense 
Paris, France) and Superscript II (Life Technologies, Gaithersburg, MD). Second-strar 
synthesis was done by strand displacement with Escherichia coli ligase (Life Technologies 
30 and DNA polymerase I (USB, Cleveland, OH) and RNAse-H (USB). 

cDNA-AFLP analysis. 

Five hundred ng of double-stranded cDNA was used for AFLP analysis as described (Vos < 
ah, Nucleic Acids Res. 23 (21) 4407-4414, 1995; Bachem et al., Plant J. 9 (5) 745-53, 1996 
35 The restriction enzymes used were BstYl and Msel (Biolabs) and the digestion was done i 
two separate steps. After the first restriction digest with one of the enzymes, the 3' en 
fragments were collected on Dyna beads (Dynal, Oslo, Norway) by means of their biotinylate 
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tail, while the other fragments were washed away. After digestion with the second enzyme, the 
released restriction fragments were collected and used as templates in the subsequent AFLP 
steps. For preamplifications, an Msel primer without selective nucleotides was combined with a 
BstYl primer containing either a T or a C as 3' most nucleotide. PCR conditions were as 
5 described (Vos et al., 1995). The obtained amplification mixtures were diluted 600-fold and 5 
ml was used for selective amplifications using a P33-fabeled BstYl primer and the Amplitaq- 
Gold polymerase (Roche Diagnostics, Brussels, Belgium). Amplification products were 
separated on 5% polyacrylamide gels using the Sequigel system (Biorad). Dried gels were 
exposed to Kodak Biomax films as well as scanned in a phospholmager (Amersham 
10 Pharmacia Biotech, Little Chalfont, UK). 

Characterization of AFLP fragments. 

Bands corresponding to differentially expressed transcripts, among which the (partial) 
transcript corresponding to CDS0689, were isolated from the gel and eluted DNA was 

15 reamplified under the same conditions as for selective amplification. Sequence information 
was obtained either by direct sequencing of the reamplified polymerase chain reaction product 
with the selective BstYl primer or after cloning the fragments in pGEM-T easy (Promega, 
Madison, Wl) or sequencing of individual clones. The obtained sequences were compared 
against nucleotide and protein sequences present in the publicly available databases by 

20 BLAST sequence alignments (Altschul et al., Nucleic Acids Res. 25 (17) 3389-3402 1997). 
When available, tag sequences were replaced with longer EST or isolated cDNA sequences to 
increase the chance of finding significant homology. The physical cDNA clone corresponding 
to CDS0689 was subsequently amplified from a commercial Tobacco cDNA library as follows. 

25 Cloning of a tobacco CDS0689 seedyl gene (CDS0689) 

A c-DNA library with average inserts of 1,400 bp was made with poly(A+) isolated from actively 
dividing, non-synchronized BY2 tobacco cells. These library-inserts were cloned in the vector 
pCMVSPORT6.0, comprising an attB gateway cassette (Life Technologies). From this library 
46,000 clones were selected, arrayed in 384-well microtiter plates, and subsequently spotted 

30 in duplicate on nylon filters. The arrayed clones were screened by using pools of several 
hundreds of radioactively labelled tags as probe (among which the BY2-tag corresponding to 
the sequence CDS0689). Positive clones were isolated (among which the clone reacting with 
the BY2-tag corresponding to the sequence CDS0689), sequenced, and aligned with the tag 
sequence. Alternatively, when the hybridization with the tag would fail, the full-length cDNA 

35 corresponding to the tag was selected by PCR amplification as follows. Tag-specific primers 
was designed using primer3 program (http://www- 

genome.wi.mitedu/genome_software/other/primer3.html) and used in combination with the 
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common vector primer to amplify partial cDNA inserts. Pools of DNA from 50.000, 100.00 
150.000, and 300.000 cDNA clones were used as templates in the PGR amplification 
Amplification product were isolated from agarose gels, cloned, sequenced and aligned wi 
tags. The vector comprising the sequence CDS0689 and obtained as described above, we 
5 referred to as entry clone. 

Example 2: Vector construction for transformation with PRO0090-CDS06& 
cassette 

The entry clone was subsequently used in an LR reaction with p0830, a destination vecti 
10 used for Oryza sativa transformation. This vector contains as functional elements within the " 
DNA borders: a plant selectable marker; a plant screenable marker; and a Gateway casseti 
intended for LR in vivo recombination with the sequence of interest already cloned in the enti 
clone. The rice prolamin RP6 promoter for endosperm-specific expression (PRO0090) 
located upstream of this Gateway cassette. 

15 

After the LR recombination step, the resulting expression vector as shown in Fig. 2 ran b 
transformed into Agrobacterium and subsequently into Oryza sativa plants. Transformed ric 
plants were allowed to grow and then examined for various parameters as described i 
example 3. 

20 

Example 3: Evaluation of TO, T1 and T2 transgenic rice plants transformed wit 
prolamin:: seedy 1 (PRO0090-CDS0689) and results 

Approximately 15 to 20 independent TO rice transformants were generated. The primar 
transformants were transferred from tissue culture chambers to a greenhouse for growing an 
25 harvest of T1 seed. 4 events, of which the T1 progeny segregated 3:1 for presence/absence < 
the transgene, were retained. For each of these events, approximately 10 T1 seedling 
containing the transgene (hetero- and homo-zygotes), and approximately 10 T1 seedling 
lacking the transgene (nullizygotes), were selected by monitoring screenable marks 
expression. 

30 2 events with improved agronomical parameters in T1 were chosen for re-evaluation in T2 
Seed batches from the positive plants (both hetero- and homozygotes) in T1, were screene< 
by monitoring marker expression. For each chosen event, the heterozygote seed batches wer 
then selected for T2 evaluation. An equal number of positives and negatives within each seei 
batch were transplanted for evaluation in the greenhouse. The total number of 120 seedy 

35 transformed plants were evaluated in the T2 generation . More particularly, two seedy 
transformed events have been selected, 60 plants per event of which 30 positives for th< 
transgene, and 30 negative. 
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T1 and T2 plants were transferred to the greenhouse and evaluated for vegetative growth 
parameters and seed parameters, as described hereunder. 

Table 1: overview of plants involved in T1 and T2 plant evaluation 
Number and types Number of plants per Number of positive s Number of negatives 
of lines per evaluation per line per line 

evaluation 
4 lines of T1 plants 
2 lines of T2 plants 
1 line of T1 plants 
1 line of T2 plants 

5 

Statistical analysis: t-test and F-test 

A two factor ANOVA (analysis of variants) was used as statistical model for the overall 
evaluation of plant phenotypic characteristics. An F-test was carried out on all the parameters 
measured, for all of the plants of all of the events transformed with the gene of interest. The 
10 F-test was carried out to check for an effect of the gene over all the transformation events and 
to determine the overall effect of the gene or "global gene effect 0 . Significant data, as 
determined by the value of the F-test, indicates a "gene" effect, meaning that the phenotype 
observed is caused by more than the presence or position of the gene. In the case of the F- 
test, the threshold for significance for a global gene effect is set at a 5% probability level. 

15 

To check for an effect of the gene within an event, i.e., for a line-specific effect, a t-test was 
performed within each event using data sets from the transgenic plants and the corresponding 
null plants. "Null plants" or "Null segregants" are the plants treated in the same way as the 
transgenic plant, but from which the transgene has segregated. Null plants can also be 

20 described as the homozygous negative transformants. The threshold for significance for the 
t-test is set at 10% probability level. Within one population of transformation events, some 
events can be under or above this t-test threshold. This is based on the hypothesis that a 
gene might only have an effect in certain positions in the genome, and that the occurrence ol 
this position-dependent effect is not uncommon. This kind of gene effect may also be referred 

25 to as a "line effect of a gene". The p-value is obtained by comparing the t-value to th€ 
t-distribution or alternatively, by comparing the F-value to the F-distribution. The p-valuc 
stands for the probability that the null hypothesis (null hypothesis being there is no effect o 
the transgene") is correct 



80 
120 
20 
60 



10 
30 
10 
30 



10 
30 
10 
30 



40 



105-seedyl-EP 

Vegetative growth measurements 

The selected transgenic plants were grown in a greenhouse. Each plant received a uniqu 
barcode label to link unambiguously the phenotyping data to the corresponding plant. Th 
selected transgenic plants were grown on soil in 10 cm diameter pots under the followin 
5 environmental settings: photoperiod= 11.5 h, daylight intensity^ 30,000 lux or more, daytim< 
temperature= 28°C or higher, night time temperature^ 22°C, relative humidity= 60-70% 
Transgenic plants and the corresponding nullizygotes were grown side-by-side at randor 
positions. From the stage of sowing until the stage of maturity each plant was passed severs 
times through a digital imaging cabinet and imaged. At each time point digital image 
10 (2048x1536 pixels, 16 million colours) were taken of each plant from at least 6 different angles 
The parameters described below were derived in an automated way from all the digital image 
of all the plants, using image analysis software. 

(a) Aboveground plant area 

15 Plant aboveground area was determined by counting the total number of pixels iron 
aboveground plant parts discriminated from the background. This value was averaged for th* 
pictures taken on the same time point from the different angles and was converted to < 
physical surface value expressed in square mm by calibration. Experiments show that th 
aboveground plant area measured this way correlates with the biomass of plant parts abov< 

20 ground. 

b) Number of primary panicles 

The tallest panicle and all the panicles that overlap with the tallest panicles when alignet 
vertically were counted manually, and considered as primary panicles. 

25 

Seed-related parameter measurements 

The mature primary panicles of T1 and T2 plants were harvested, bagged, barcode-labelle< 
and then dried for three days in the oven at 37°C. The panicles were then threshed and all the 
seeds were collected and counted. The filled husks were separated from the empty one: 
30 using an air-blowing device. The empty husks were discarded and the remaining fraction wai 
counted again. The filled husks were weighed on an analytical balance. This procedun 
resulted in the set of seed-related parameters described below. 

(c) Number of filled seeds 
35 The number of filled seeds was determined by counting the number of filled husks tha 
remained after the separation step. 
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(d) Total seed yield per plant 

The total seed yield was measured by weighing all filled husks harvested from a plant. 

The results are given in % of difference between the positive plants and the corresponding 
5 nullizygotes (negative) plants of a transgenic line. The values given in table 2 to 4 are the 
average of 4 T1 lines and the average for T2 lines. 

Table 2: overview of phenotypic data of seedyl transgenic T1 and T2 plants for above 
ground area 

% difference between pos. and neg. plants for above ground area 
_ p|ante T2 plants 

4 lines +12.5% 

2 lines + 25.5 % 



10 



Table 3: overview of phenotypic data of seedyl transgenic T1 and T2 plants for number 
of first panicles 

% difference between pos. and neg. plants for nr. of first panicles 
T1 pj ants T2 plants ~™ 

4 lines + 32.25 % 

2 lines + 26.5 % 

Table 4: overview of phenotypic data of seedyl transgenic T1 and T2 plants for number 
15 of filled seeds 

% difference between pos. and neg. plants for nr. of filled seeds 
TTpiante T2 plants 

4 lines + 59 % 

2 lines + 36.5 % 

Table 5: overview of phenotypic data of seedyl transgenic T1 and T2 plants for total 
seed weight per plant 

% difference between pos. and neg. plants for total seed weight per plant 
_ p|ante T2 plants ~~™ ~ ~~ ~ ™ -— — — 

4 lines +70% 

2 lines + 47 % 
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Example 4: Use of the invention in corn 

The invention described herein is also be used in maize. To this aim, a seedyl encoding gen< 
for example a maize or other ortholog, is cloned under control of a seed-specific promote 
5 operable in maize, in a plant transformation vector suitable for Agrobacterium-mediated cor 
transformation. Methods to use for com transformation have been described in literatur 
(Ishida et al., Nat Biotechnol. 1996 Jun;14(6):745-50; Frame et al., Plant Physiol. 200 
May;129(1):l3-22). 

10 Transgenic plants made by these methods are grown in the greenhouse for T1 see 
production. Inheritability and copy number of the transgene are checked by quantitative rea 
time PCR and Southern blot analysis and expression levels of the transgene are determine 
by reverse PCR and Northern analysis. Transgenic lines with single copy insertions of th 
transgene and with varying levels of transgene expression are selected for T2 see 

15 production. 

Progeny seeds are germinated and grown in the greenhouse in conditions well adapted fc 
maize (16:8 photoperiod, 26-28°C daytime temperature and 22-24°C nighttime temperature 
as well under water-deficient, nitrogen-deficient, and excess NaCI conditions. Null segregant 

20 from the same parental line, as well as wild type plants of the same cultivar are used a 
controls. The progeny plants resulting from the selling or the crosses are evaluated on differer 
biomass and developmental parameters, including, stem size, number of leaves, total abov 
ground area, leaf greenness, time to maturity, flowering time, time to flower, ear numbei 
harvesting time. The seeds of these lines are also checked on various parameters, such a: 

25 grain size, total grain yield per plant, and grain quality (starch content, protein content and o 
content). 

Lines that are most significantly improved compared to corresponding control lines an 
selected for further field-testing and marker-assisted breeding, with the objective of transferrin! 
30 the field-validated transgenic traits into commercial germplasm. The testing of maize fo 
growth and yield-related parameters in the field is conducted using well-established protocols 
Similarly, introgressing specific loci (such as transgene containing loci) from one germplasn 
into another is also conducted using well-established protocols. 
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Claims 



1. An isolated nucleic acid encoding at least part of a seedy 1 protein, wherein sa 
seedy 1 protein comprises in the following order from N-terminus to C-terminus at least tvt 
5 motifs selected from: 

(i) a motif having at least 80% sequence identity to the sequence represented by SE 
ID NO 15; and/or 

(ii) a motif having at least 80% sequence identity to the sequence represented by SE 
ID NO 16, and/or 

10 (iii) a motif having at least 80% sequence identity to the sequence represented by SE 
ID NO 17, a coiled coil motif; and/or 
(iv) a motif having at least 80% sequence identity to the sequence represented by SE 
ID NO 18. 



15 2. An isolated nucleic acid selected from: 

(i) a nucleic acid represented by any of SEQ ID NO: 1 , 5, 9, or the complement stran 
thereof; 

(ii) a nucleic acid encoding an amino acid sequence represented by SEQ ID NO: 2, - 
6, 8 or 10 or a homologue, derivative or active fragment of any of th 

20 aforementioned sequences; 

(iii) a nucleic acid capable of hybridising with a nucleic acid of (i) or (ii) above, whic 
hybridising sequence preferably encodes a protein having seedyl protein activity; 

(iv) a nucleic acid which is degenerate from any one of the nucleic acids of (i) to (ii 
above as a results of the genetic code; 

25 (v) a nucleic acid which is an allelic variant of any one of the nucleic acids of (i) to (iv); 

(vi) a nucleic acid which is an alternative splice variant of any one of the nucleic acid 
of(i)to(v); 

(vii) a nucleic acid encoding a protein which has at least 21%, 22%, 23%, 24%, 25°/ 
26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 39°/ 

30 ....... 40%; 41%, 42%, 43%; 44%, 45%, 46%; 47%, 48%, 49%, 50%, 60%, 70%, 75°/ 

80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to any one c 
more of the sequences defined in (i) to (vi), which protein preferably encodes 
protein having seedyl activity; 

(viii) a portion of a nucleic acid according to any of (i) to (vii) above, which portio 
35 preferably encodes a protein having seedyl activity. 
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3. An isolated seedyl protein comprising in the following order from N-terminus to C- 
terminus at least two motifs selected from: 

(i) a motif having at least 80% sequence identity to the sequence represented by SEQ 
ID NO 15; and/or 

(ii) a motif having at least 80% sequence identity to the sequence represented by SEQ 
ID NO 16, and/or 

(iii) a motif having at least 80% sequence identity to the sequence represented by SEQ 
ID NO 17, a coiled coil motif; and/or 

(iv) a motif having at least 80% sequence identity to the sequence represented by SEQ 
ID NO 18. 

4. An isolated seedyl protein, comprising 

a. a polypeptide with an amino acid sequence as presented in any one of SEQ ID NO 
2, 4, 6, 8 or 10; 

b. a polypeptide with an amino acid sequence which has at least 20%, 21%, 22%, 
23%, 24%, 25%, 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%, 36%, 
37%, 38%, 39%, 40%, 41%, 42%, 43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%, 
60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity 
to any one or more of the amino acid sequence as described in (a) 

c. a polypeptide which is a homologue, derivative, immunologically active and/or 
functional fragment of a protein as defined in any of (a) or (b). 

5. An isolated genetic construct comprising: 

(i) a nucleic acid of claim 1 or 2 or a nucleic acid encoding a protein according to 
claim 3 or 4; 

(ii) one or more control sequence capable of regulating expression of the nucleic 
acid of (i); and optionally 

(iii) a transcription termination sequence. 

6. An isolated genetic construct according to claim 5, wherein said nucleic acid is 
represented by SEQ ID NO 1 or a portion thereof or by sequences capable of hybridising 
therewith, which nucleic acid is preferably from a dicotyledonous plant, further preferably 
from the family Solanaceae, more preferably the nucleic acid is from Nicotians, or is a 
nucleic acid encoding an amino acid sequence represented by SEQ ID NO: 2 or a 
homologue thereof having at least, 21%, 22%, 23%, 24%, 25%, 26%, 27%, 28%, 29%, 
30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 39%, 40%, 41%, 42%, 43%, 44% 
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45%, 46%, 47%, 48%, 49%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98< 
or 99% sequence identity with SEQ ID NO 2, or derivative or active fragment thereof. 

7. An isolated genetic construct according to claim 5 or 6, wherein said control sequenc 
5 is a plant promoter, preferably a seed-specific promoter, more preferably an endospernr 
specific promoter, more preferably a promoter isolated from a gene encoding a seec 
storage protein, most preferably a promoter isolated from a prolamin gene, such as fc 
example the rice prolamin promoter of SEQ ID NO 14. 

An isolated genetic construct according to claim 7, comprising an nucleic acid selecte 
from: 

a. A nucleic acid having at least a part of a seed-preferred promoter and the nuclei 
acid as defined in claims 1 or 2; and 

b. A nucleic acid as presented in SEQ ID NO 13, or the complementary strand thereo 

c. a nucleic acid which is degenerated as a result of the genetic code from any of th 
nucleic acids of (a) or (b); 

d. a nucleic acids which is an allelic variant of any of the nucleic acids of (a) or (b); 

e. a nucleic acid which hybridises to any of the nucleic acids of (a) or (b). 

20 9. Method for modifying growth characteristics of a plant, comprising modifying in a plar 
expression of a nucleic acid according to claim 1 or 2 and/or modifying in a plant lev< 
and/or activity of a seedyl protein according to claim 3 or 4, wherein said growti 
characteristics are modified relative to corresponding wild-type plants. 

25 10. Method according to claim 9, wherein said modification is effected by recombinar 
means and/or chemical means. 

11. Method according to claim 9 or 10, comprising introducing, into a plant, a nucleic aci< 
capable of modifying expression of a gene encoding a seedyl protein and/or capable c 

30 modifying levels and/or activity of a seedyl protein. 

12. Method according to claim 11, wherein said nucleic acid is a nucleic acid of claim 1 c 
2. 

35 13. Method according to claim 12, wherein said nucleic acid is as represented by SEQ II 
NO 1 or a portion thereof or sequences capable of hybridising therewith, which nucleic aci 
is preferably from a dicotyledonous plant, further preferably from the family Sotenaceae 
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more preferably the nucleic acid is from Nicotiana, or is a nucleic acid encoding an amino 
acid sequence represented by SEQ ID NO 2 or a homologue thereof having at least, 20%, 
21%, 22%, 23%, 24%, 25%, 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%, 
36%, 37%, 38%, 39%, 40%, 41%, 42%, 43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%, 
5 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity with 

SEQ ID NO 2, or derivative or active fragment thereof. 

14. Method according to any of claims 9 to 1 3, wherein said nucleic acid is an allelic variant 
of a nucleic acid encoding the seedyl protein of claim 2 or 3, or wherein said seedyl 

10 protein is encoded by an allelic variant. 

15. Method according to any of claims 9 to 13, wherein said nucleic acid is an alternative 
splice variant of a nucleic acid encoding the seedyl protein of claim 2 or 3, or wherein said 
seedyl protein is encoded by a splice variant. 

15 

16. Method according to any claims 9 to 13, wherein said nucleic acid is comprised on at 
least a part of an artificial chromosome, which artificial chromosome preferably also 
comprises one or more related gene family members. 

20 17. A method for the production of a transgenic plant having modified growth 
characteristics, which method comprises: 

(i) Introducing into a plant cell a nucleic acid of claim 1 or 2; 

(ii) cultivating the plant cell under conditions promoting regeneration and mature 
plant growth. 

25 

18. Method according to daim 17, wherein said nucleic acid is overexpressed in a plant. 

19. Method according to any of claims 17 or 18, wherein said nucleic acid is introduced in 
the sense direction into a plant. 

30 

20. Method according to any of claims 17 to 19, wherein expression of said nucleic acid is 
driven by a seed-preferred promoter. 

21 . Method according to any of claims 17 to 20, wherein expression of said nucleic acid is 
35 driven by a promoter of a seed-storage protein. 

22. Method according to claim 21 , wherein the promoter is a prolamin promoter. 
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23. Method for breeding plants having modified growth characteristics, comprising th 
steps of: 

(i) Growing a plant with modified expression of a nucleic acid of claim 1 or 2; and 
S (ii) Crossing the plant of (i) with another plant; and 

(iii) selecting progeny of the cross of (ii) having modified growth characteristics; 
wherein said growth characteristics are modified relative to the corresponding wild-typ 
plants. 

10 24. Method according to any of claims 9 to 23, wherein said modified growth characteristic 
is selected from any one or more of increased yield, increased biomass, modified plar 
architecture, each relative to corresponding wild type plants. 

25. Plants obtainable by a method according to any of claims 9 to 24, which plants hav 
15 modified growth characteristics. 

26. A host cell having modified expression of a nucleic acid of claim 1 or 2 and/or havin 
modified activity and/or level of a protein of claim 2 or 4, wherein said expression, level c 
activity is modified relative to corresponding wild type host cells. 

20 

27. A host cell according to claim 26, comprising a nucleic acid of claim 1 or 2 or and/c 
containing a genetic construct of any one of claims 5 to 8. 

28. A host cell according to claims 26 or 27, which is a cell from a plant, insect, animal 
25 yeast, fungus, algae or bacterium. 

29. A plant having modified growth characteristics, comprising the host cell of claim 28. 

30. A plant according to claim 25 or 29, wherein said plant is a crop plant selected fron 
30 rice, maize, wheat, barley, millet, oats, rye, soybean, sunflower, cariola, sugarcane, alfalfc 

leguminosae (bean, pea), flax , lupinus, rapeseed, tobacco, tomato, potato, squash 
papaya, poplar and cotton, preferably said plant is a monocotyledonous plant, mon 
preferably a cereal. 

35 31. Plant part, preferably a harvestable plant part, a propagule or progeny from a plan 
according to claim 25, 29 or 30. 



48 



105-seedy1-EP 

32. Use of a nucleic acid of claim 1 or 2 and/or of a seedyl protein of claim 3 or 4 in 
modifying the growth characteristics of a plant 

33. Use of a nucleic acid of claim 1 or 2 or of a seedyl protein of claim 3 or 4 as 

(i) a growth regulator, such as a herbicide or a growth stimulator; or 

(ii) as a target for an agrochemical compound, such as a herbicide or a growth 
stimulator; or 

(iii) use of a nucleic acid of claim 1 or 2 in breeding programs; or 

(iv) use of a plant according to any one of claims 25, 29 or 30, or use of a product 
derived therefrom in animal feed or in a food product. 

(v) Use of plants according to any one of claims 21, 29 or 30 for the production of 
(industrial) enzymes, pharmaceuticals or plant effective agents. 

34. (Industrial) enzymes, pharmaceuticals and plant effective agents, produced in a plant 
according to any one of claims 25, 29 or 30. 

35. A food product derived from a plant according to any one of claims 21 f 29 or 30. 

36. A growth regulating composition, such as a herbicide or a growth stimulator, comprising 
a nucleic acid of claim 1 or 2 and/or comprising a seedyl protein of claim 3 or 4, said 
composition further comprising substances normally added to growth regulating 
compositions. 



49 



105-seedy1-EP 

Abstract 

Plants having modified growth characteristics and a method for making 

the same 

The present invention concerns a method for modifying growth characteristics of a plant, in 
5 particular increasing biomass and seed yield and modifying plant architecture, by modifying 
expression of a seedyl nucleic acid and/or modifying levels and/or activity in a plant of a 
seedyl protein. The invention also relates to isolated nucleic acids encoding a seedyl protein, 
which seedyl protein has a coiled coil domain as represented in SEQ ID NO 17 and three 
conserved motifs as presented in SEQ ID NO 15, 16 and 18. Further the invention relates to 
10 transgenic plants having modified growth characteristics, which plants have modified 
expression of a nucleic acid encoding a seedyl protein or modified levels and/or activities of a 
seedyl protein. 
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CDS0689 Os 
CDS0689 Ta variant 
CDS0689 So 
CDS0689 Zm variantrev 
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i Motif 1 40 

( 1 } MSpLQYPEGI DPAgjJ<^J^Ai^GgSEDLS^a^ 

(1) MTs||bATETLNAP P^^^^S^gAFgDGgSQl^S^pEA 

( 1 ) MNNTNNNNILLHS TQV<^K»AEgGE§FAJ^JSgSDS 

(1) MEEDPlgPLVlSN^ABpDSSCSI^ARgPQ 

{ ! ) MMEEDPI^PLViS^^AE^SSSSSSAWHAHA 

(D MEEDPl||PLVlMN^AE&!iSSSAWI^HSP 

(1) MEEDPI^LVHMSgAClNiAlSSiVCHAH 

(1) MEEDPlfPLVtl&HPEj^YSAWI^SP 

(1) MTSTEHTETLNAE^<^l»AEfcc^Nl||sS B A 

(i) mnevlh]^ 

(1) I E I PEVQ I WNNAAFD GDS S AI 

41 Motif 2 80 

(37) — SWSPLKPLSVRPSI^FESDLSS^^^LFENSSVNLS 

< 38 ) SSWSHLN ESFDSfCS - A SVSSSIiQSS 

(32 > SP AVAAVR3<g^^&^fewD 

(32) SPAPi^AI^^I^SLI0BPDV 

(37) SWSN^5 S ^^P^^p5^ SVSSSLKSS 

(35) ASSSPLKPIVI^QSEE^II^^r^pQTipCCISPVRTK 

< 37 ) SWCNFNS GSVNOSl^^gS^gQS&WIKSPVSFK 

( 3 e ) SWWNQSI^^^sSjLSlVCEQSS PVFV 

(41) — F SAPNLSC^LLSDSSI^pLSisLAEMPHPAK 

(37) SWANLKS VYMNQSI^PCS^BLSgRLNKSPTSSL 

(41) S ESDG KEN P S . 

81 120 
(75) S PL P I KPLNPNGALENSRLKPNKPNSKQS LDEMAARKSGK 
(69) VSITEAPS AKSKTVKTKSAADRSKK- 

(49) IVPS SNK— 

(53) VAAG 

(50) 

(54) 



(55) 

(54) 

(67) VSFSTDDP IRCGKVK EKPH 

(75) SPLPIKPLHPNG 

(72) STASWKP LSSKNVTGNTREPFSAKMKSGVCKE 

(66) NSSKPAKPLQ 

(73) SPMQK 

(72) KSCVPNKPLQVN SSVKNSQMKQLK — SVSK 

<81) , 01 Motif 3: coiled coil lfi0 



(115) GN DFRDEKK|| DEE I EE I QME I S RL S S RL EALR§|EKAEKT V 
(94) RcSdAE IEE^EKE I GRLSSLESLRgEKAEOTA 

(56) ro^ddeiaeieseikrl^srlelli^ekaerki 

( 57 ) if d$||ae i ghx eae i lrl ss rlhhlr||skq pe pn 

(50) dadaeiarieae ilrlssrlhhlrgskghdak 

(54 ) d|||aeighieae ilrlxsrlhhlrt skqseps 

<55) 3 ee #^ ie ^ i3 ^ slrlhhlrtsqqlqpp 

(54 ) dMaeighieaeilrlssrlhhlrtskqsepp 

(86) KTGKVRHGD§DAEIEE^EKE§NRLS irleslb||ekaeqi a 

(87) 

(105) EEKKRDEKKfDMEIEEIEKI^ARLS^ESLRiEKPNIMQ 

(76) 

(78) 

(100) eeetrderk^dieieeiekeisrlssrleaw^kidikt 
(121) id ei ie ei rlssrl lrl k 
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CDS0689 (339) 

CDS0689 At (294) 

CDS0689 Medicago trunculata (284) 

CDS0689 Os (297) 

CDS0689 Hv contig 123 (111) 

CDS0689 Ta contig (1) 

CDS0689 Zm partial (87) 

CDS0689 Sacc sp 3' (84) 

CDS0689 Pinus taeda 3* (1) 

Consensus (401) 

CDS0689 (379) 

CDS0689 At (327) 

CDS0689 Medicago trunculata (297) 

CDS0689 Os (337) 

CDS0689 Hv contig 123 (150) 

CDS0689 Ta contig (21) 

CDS0689 Zm partial (127) 

CDS0689 Sacc sp 3' (124) 

CDS0689 Pinus taeda 3' (1) 

Consensus (441) 



401 440 

3S 

-RASDESN 

6GD — ARKRSFSEN N 

AAAAATAKRMAGSSKK^VIPSRy&lTPGiSL^SSGAQERR 
ISTASTCiRPAGSSKVgWPSRYSpMP^ST^i-AATQDGR 
GRYs[SmPG||s L||AASQERRR 

tsnvattkrpagssk^wpsry^|pp^si.|avtqgnrc 
tsnaatakrpagssk\7gwpsry^tp(^yl|avsqdkrs 

KR R RYSL PGA LG 

441 480 

KNL C^SRVKKlWEIPiEIVVHGNTESEKSPL^IVK 

K &GRVKKgWEIP§EVDLYSSGEN — GDe||pIVK 

KGL G§EIRAKKgWEIPIEEVlvSG 

CQSLP&^SGDANQ&EII^KVlipSlJ-'-DPIiSE 
!SLPGlTGSTGQfBEIK^VPTj|pVD — DDLSE 
KESLPGS§GGAGQKBEEIKpyiPTlPVD--DDLSP 

KQS LPGSATETP^NLTEPPNDE LSPEE 

KQS-LPGPASAASQfEElcRlKLTlPSK— DELSPEf 

XEARlfFGTGNSAIMAGGTKgE 

K GS R EI A E LSPESL KV 




CDS0689 
CDS0689 At 
CDS0689 Medicago trunculata 
CDS0689 Os 
CDS0689 Hv contig 123 
CDS0689 Ta contig 
CDS0689 Zm partial 
CDS0689 Sacc sp 3' 
CDS0689 Pinus taeda 3» 
Consensus 

CDS0689 
CDS0689 At 
CDS0689 Medicago trunculata 
CDS0689 Os 
CDS0689 Hv contig 123 
CDS0689 Ta contig 
CDS0689 Zm partial 
CDS0689 Sacc sp 3 9 
CDS0689 Pinus taeda 3' 
Consensus 

CDS0689 
CDS0689 At 
CDS0689 Medicago trunculata 
CDS0689 Os 
CDS0689 Hv contig 123 
CDS0689. Ta contig 
CDS0689 Zm partial 
CDS068S Sacc sp 3 * - 
"CDS0689 Pinus taeda 3» 





Motif 4 

520 

ISFFSgDE 

[RRVGGSP 
[STMRFVDESPj 
ITMPPPDES: 
TMPRPNE||P| 
TMPPPDEfgPi 
TMPPSDESPj 

:pasdese^ 

'TES 

481) AELLPRIRTMP DES PRDS G CAKRVADL VGKRS FF AAA 
521 560 

456) DKEPPVCQVLSFAEEfjAEEE 

389) DRNFTFCQ§LKFEE 

361) EEERVMVEEEGGSVCQVLWFAEE^)DDDDYGEQG 

228) AGD GSAISSYQjlfv^^ 

(99) AGD CSAISSYq|||Vi|^ 

198) DDG NLVTPYCS^W 3 ^^!^^ 11517 ^ 

201) EDG N FVT P YQ||p V(|f L 

(69) ITST — PSIDXXDA^PItRRE^VHEIXHAXXX 

521) ARVLE EA E 

561 592 

476) 

395) 

432) 

265) ALSDEAAAAAAAAEALSDEAAAAEALSDEAAA 

136) GE ALGDEAAA • 

226) ~ — 

217) 

Consensus (561) 
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SEQ ID NO 1: Nicotlana tabacvm seedyl coding sequence 
(CDS0689) 

atgagtgtgttacaatacccagaagggattgacccagcagatgttcagatatggaacaatgc 
agcatttgataatggagattctgaagatttgtcttcgctgaaacgttcttggtctcctctga 
aacccctttcggttaggccatcagattcctttgaatctgatttgtcaagtaaggaaaatcaa 
actcctttatttgagaattcatctgttaatctctcatctccgttacccataaagccacttaa 
ccctaatggggctctggaaaattcaagactcaagccgaacaagcccaattccaaacagagtc 
ttgatgagatggcggctagaaagagcggaaagggaaatgatttccgtgatgagaagaaaata 
gacgaggaaattgaagaaattcagatggagattagtaggttgagttcaagattagaggcttt 
gagaattgaaaaggctgagaaaactgttgctaagactgttgaaaagcgaggaagggttgtgg 
cagcaaagtttatggagccaaaacaaagtgttattaagattgaagagcgtatatcaatgagt 
gcaagaacaaaggtggagcagagaaggggtcttagtttaggaccatctgagatttttactgg 
aacgcggcggcgagggttgagtatggggccatcagatattctagcagggacaacaaaggcac 
ggcaattgggaaagcaagagatgattattactcctattcagccaatacaaaacaggcgaaag 
tcgtgtttttggaagcttcaagagattgaagaagagggaaaaagttcaagccttagtcctaa 
atcaagaaaaactgctgcaagaacaatggttacaacaaggcaggcagttactacaattgcat 
caaagaagaatttgaaaaaagatgatggacttttgagttcagttcagccaaagaagttgttt 
aaagatctcgaaaagtctgctgctgctaataagaagccccagaggccggggagggttgtggc 
tagtaggtataatcagagtacaattcagtcatcagtagtgagaaagaggtctttacctgaaa 
atgataaggatgagagtaagagaaatgataagaaacggtcgttatctgtagggaaaacgcgt 
gtgtctcaaactgagagcaagaatttgggtactgaaagtagggtgaaaaagagatgggaaat: 
tcctagtgagattgtagttcatggaaacacagagagtgagaaatctccactaagcattattg 
tgaagcctgatttgcttccgcgaattaggattgctcggtgtgtgaatgagactcttagggat 
tctggacctgctaaaagaatgatagagttgataggcaagaaatcgtttttcagtagtgatga 
agataaggagccacctgtctgtcaagttttaagttttgcagaggaagatgctgaagaggaat 
aa 



SEQ ID NO 2: Nlcotlana tabacum seedyl protein (CDS0689) 

MSVLQYPEGIDPADVQIWNNAAFDNGDSEDLSSLKRSWSPLKPLSVRPSDSFESDLSSKENQ 
TPLFENSSVNLSSPLPIKPLNPNGALENSRLKPNKPNSKQSLDEMAARKSGKGNDFRDEKKI 
DEEIEEIQMEISRLSSRLEALRIEKAEKTVAKTVEK^^ 

ARTKVEQRRGLSLGPSEIFTGTRRRGLSMGPSDILAGTTKARQLGKQEMIITPIQPIQNRRK 
SCFWKLQEIEEEGKSSSLSPKSRKTAARTMVTTRQAVTTIASKKNLKKDDGLLSSVQPKKLF 
KDLEKSAAANKKPQRPGRWASRYNQSTIQSSVVRKRSLPENDKDESKRNDKKRSLSVGKTR 
VSQTESKNLGTESRVKKRWEIPSEIWHGNTESEKSPLSIIVKPDLLPRIRIARCVNETLRD 
SGPAKRMIELIGKKSFFSSDEDKEPPVCQVLSFAEEDAEEE 
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SEQ ID NO 3: Oryza satlva seedy 1 coding sequence 

atggaggaggacccgctcatcccgctggtccacgtctggaacaacgccgccttcgacgactc 
ctcgtgttccagatcggcttggctcccccaaagccccgccgtcgcggccgtccgcaagggcg 
acaaggagaatcaccgccccgaggttgttgatgtcgccgccggctacgacgtcgaggccgag 
atcggccacatcgaggcggagatcctgcgcctctcgtcccggctccaccatctccgcgtctc 
caagcagccggagcccaaccgcgacgacgctccgatgggggagatggtcgcgaaggtgaggc 
cccggccgaggggcctcagcctcgggcccctggatgtgatctccatcgtcaatcgtgagaag 
catccgctgcgcaccaagcagcctccggcgacgcggggcagggggctcagcctcgggcccat: 
ggagatcgccgcggcgaaccctagggtgcccgcggcggcgcagcatcagcaacagcaacgcg 
ctggcacggcgcggatcctgaagccaatcaaggagcctccggtgcagcgtcgcaggggcgtc 
agcctcgggccgttggagatccaccacggcgtcggcagcaaggcaccagcggcggcgcgagc 
caagccgttcaccaccaagctcaacgccattcgagaagaaacccgaccctccaagcaattcg 
ccgtccccgccaagccatggccgtcgagcaatacaaggcagacactggactcgaggcaagga 
acagcagcaagtcgagcgaaggcgaggagcccgagccccaggcccaggaggcaatccaatgg 
caaggctactgacacaaggggaggcaacaaggtggtggatgagctcaagcccaaaggtgcgt 
cgtcaagtcagagcggcagcgccgccgccgccgccactgccaagaggatggcggggagctcc 
aagatgagggtcatcccgagccgctacagcctcactcctggcgcttcccttggaagcagtgg 
agcacaggagaggcgacgcaagcagtctctcccaggatcatcaggggatgcgaaccagaatg 
aggaaatcagagcgaaggtcatcgagccttccaatgatccactctctcctcaaacgatctcc 
aaggttgctgaaatgctcccaaagatcaggaccatgccgcctcctgacgagagccctcgcga 
ttccggatgcgccaagcgggt.t:gccgaattggtcgggaagcgctcgttcttcacggctgcag 
ccgaggacgggcgggcgctcgacgtcgaagcacccgaggcggtcgcagaagcttgagatgaa 
ccaccatggtttgatccgttccttccatcagctc 



SEQ ID NO 4: Oryza. satxva seedy 1 protein 

MEEDPLIPLVBVWNNAAFDDSSCSRSAWLPQSPAVAAVRKGDKENHRPEVVDVAAGYDVEAE 
IGHIEAE I LRLS SRLHHLRVSKQPEPNRDDAPMGEMVAKVRPRPRGLSLGPLDVI S I VNREK 
HPLRTKQPPATRGRGLSLGP3«LEIAAANPRVPAAAQHQQQQRAGTARILKPIKEPPVQRRRGV 
SLGPLEIHHGVGSKAPAAARAKPFTTKLNAIREETRPSKQFAVPAKPWPSSNTRQTLDSRQG 
TAAS RAKARS P S PRPRRQSNGKATDTRGGNKWDELKPKGAS S S QS GS AAAAATAKRMAGS S 
KMRVIPSRYSLTPGASLGSSGAQERRRKQSLPGSSGDANQNEEIRAKVIEPSNDPLSPQTIS 
KVAEMLPKIRTMPPPDESPRDSGCAKRVAELVGKRSFFTAAAEDGRALDVEAPEAVAEA 
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SEQ ID NO 5: Medlcago trunculata seedyl encoding sequence 

aaaaacgttaaggactaaaaatataataaaatttaagtagggattcataatggaagcacccc 
tatttacagggatcttaaatataattaaccctaatatttatgacagaaacccttttgaaatc 
acatcggagcgtgtatgagtagccgtttcacatccaacggccagtaagagcgtaactttatt 
tcttccctcttcaatctccaacggtcacataatctcttccaaatacaaataattccctcttt 
caacctcactcttcatttcttcaacccaaacccaaaaaactaatcagattcttcttaaatct 
tgaaacctttctcccaaaagcacttaaataaaaaagcacttaaccatgaataacacaaacaa 
caacaacattcttcttcattccacacaggttcaagtgtggaacaacgcagcattcgatggtg 
aagatttcgccatgaattcatcttctgattccatcaaagagaatctaaacccatccgcattc 
aacattgttccttcttcaaacaaaagaactattgatgatgaaattgcggaaattgaaagtga 
aattaagcgattaacttcgaagctggaattgcttcgtgttgaaaaagctgaaagaaaaatcg 
cttctgaaaagcgtgttagtggaattggtactggaagaatagtagcagcgaagtttatggaa 
ccgaagaaaaacgttacaccgaaacgaaacggtgtcgttttcaaggaggagacaccgaaacg 
aaacggtgtcgtttcggatacgccgaaatctagggttaattggagaagagggatgagtttag 
gtccgatggagattgccgggaaagtgatggcaccgccggcgatgacgattactccggcgacg 
gtgaatcggaggaagtcttgtttctggaaaccgcaggaaagttgtgaagtaatgccgtcggg 
gattactccggcgacggtgaataggaggaaatcttgttttttgaaacctcaagaaagttgtg 
aagaaaatcgaagaaaaacgatttgcaaaccgaatttgaatttgaattcaaattcagttaat 
tctgcggttggatcgattaagcgtgtgaagaagaaagatgaagaaattgctcaggttcaacc 
gaagaagctgtttgaaggtgaaaaatcagtgaagaaatcgttgaaacaaggtagaattgttg 
caagccggtataattccggtggtggtggtggtgatgcgaggaaaagatcgttttcggagaat 
aataagggtttagggagtgaaatcagggctaagaagagatgggagataccaattgaagaagt 
ggatgtgagtggttttgttatgttaccgaagatttcgacaatgaggtttgttgatgagagtc 
ctagagattctggtgctgttaaaagagttgctgaattgaatggaaaaagatcttacttttgt 
gatgaagatgaggaggagagagtgatggtggaggaagaaggtggttctgtttgtcaggtttt 
gaattttgctgaagatgatgatgatgatgatgattatggtgaacaagggtaattgtggaaat 
tggaattgatttgtttttgtggggttgtgtggaactggctatgttctgcttgattcttttgc 
attttggtgtgaaactaaagatgaggtgaaaagtttatgcttgttaaattggattggtttat 
atgttttgaaataataacaacaagcatgtgtcttgcttaataattgtatattgttttgtttg 
ttttataatgatatggatttaatttgtatacacaatataatatagtatgcattgagagagtt 
tttcgttcagtattcattctgattttagtgtttatctcattctagaagattgtattttgttg 



SEQ ID NO 6: Medlcago trvnculata. seedyl protein 

MNNTNNNNILLHSTQVQVWNNA 
AEIESEIKRLTSKLELLRVEKAERK^ 

EETPKRNGWSDTPKSRVmRRGMSLGPMEIAGKVMAPPA^ 

EVMPSGITPATVNRRKSCFLKPQESCEENRRKTICKPNLNLNSNSVNSAVGSIKRVKKKDEE 
IAQVQPKKLFEGEKSVKKSLKQGRIVASRYNSGGGGGDARKRSFSENNKGLGSEIRAKKRWE 
I P I EE VD VS G FVMLPK I S TMRF VDE S PRD S GAVKRVAE LNGKRS YFC DEDEEERVMVEEE GG 
SVCQVLNFAEDDDDDDDYGEQG 
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SEQ ID NO 7: Saccharum sp. encoding sequence (partial 5' end) 

Cgcaccgcgagtttcgaaaaaccaacctatcgcgcctcagatcacgcgaggacgcgagggga 
agcaggaatccctccgctcccagccgcctcctccgctcacccatcgatcgatcgtccgtccg 
gtccagggggctctccggcggcggtggcgatggaggaggacccgctcatcccgctggtgcac 
gtctggaacaacgccgccttcgaccacgcctcctcctccgcgtggcacgcccactcccctgt 
gcccgcgagcgcacgtcgcgaggcggagggggacaaggagaaccaccgccccgaccccgacc 
ccgacgtcgaggcggagatcggccacatcgaggcggagatcctgcgcctgtnctcccgcctn 
caccaccttcgcacctccaagcagtcggagccgtccaagcgcggagaggtcgcgcccgcgcc 
cgcggcgaaggcgaaagcggcggcggcggcgcggctgcggacgcgggggctcagcctgggcc 
cgctcgacgtcgccgctgccggtaaccccaacccgctcaccaccgacaaccagcagcagcag 
ccgcgtgccgcgcagggtctgaagccgatcaagcaggccacggcggcggcgggcaagggcgt 
aagacttgggccccttcgacatggtcggcgcgaaccctagggtccctccgcccn 



SEQ ZD NO 8: Sa.ccha.rum sp. seedy 1 protein (partial N-term) 

MEED PL I PLVHVWNNAAFDHAS S S AWHAHS PVPAS ARREAE GDKENHRPDPDPDVEAE I GHI 
EAEILRLXSRLHHLRTSKQSEPSKRGEVAPAPAAKAKAAAAARLRTRGLSLGPLDVAAAGNP 
NPLTTDNQQQQPRAAQGLKPIKQATAAAGKGVRLGPLRHGRR 



SEQ ID NO 9: Zea Mays seedy 1 encoding sequence (partial 3 'end) 

ccacgcgtccggccgttcgagaggaggaaggccagcgttccaaggagcacgccgtccccgcc 
agaccgtggccatccagcaatgccaggcacccactggatgccaggcaaggcaccgcagcaag 
cagagccaaggcgaggagcgggagcataagccccagcaggttcaggaggcagtccacttcca 
aggctgccgagacaagagcgggaaatgccaagcctacagaggcgacgaggggagggagcgaa 
gcggtcaatcacaccagcaatgtagccacgacgaagaggccggcggggagctccaaggtcag 
ggttgtcccgagccgctacagcatcccacctggctcctccctagcagctgtgacacaaggca 
accgatgcaagcagtctctcccaggatcggctactgagaccagagtaaatctcactgagccg 
ccgaacgacgagttgtctcctgaagaacttgccaaggttgcagagctgctcccaaggattag 
gaccatgccgccttctgatgagagcccgcgtgactcgggatgtgccaagcgtgttgctgatt 
tggtcgggaagcgatccttcttcactgctgcaggggacgatggcaatctcgttacgccctac 
caggcacgggtggttgaacttgaatcacccgaggcagcagcagaagaagcagaagcttgaga 
agtttgtctttgatcaattccgaagtggcttgcatctgggcgtggcctctttttgcagtgtg 
tgctactacatagtctactgttacattcatatcatatcacatttcctattttttcccccttg 
agacattgcttagtacttttgtgttgccttgtgaaaagagagtggaaggttcatctgctgat 
nccttgtt 

SEQ ID NO 10: Zea Mays seedy 1 protein (partial C-term) 

TRPAVREEEGQRSKEHAVPARPWPSSNARHPLDARQGTAASRAKARSGSISPSRFRRQSTSK 
AAETRAGNAKPTEATRGGSEAVNHTSNVATTKRPAGSSKVRVVPSRYSIPPGSSLAAVTQGN 
RCKQSLPGSATETRVNLTEPPNDELSPEELAKVAELLPRIRTMPPSDESPRDSGCAKRVADL 
VGKRSFFTAAGDDGNLVTPYQARWELESPEAAAEEAE 
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SEQ ID NO 11: Arabidospis thaliana seedy 1 coding sequence 

atgacatcaattgaggcaacagaaacgcttaacgctcctccaaagcttcagatctggaacaa 
cgctgccttcgacgatggagattctcaaatcacttccgccatcgaagcttcttcttggtctc 
acctcaacgaatcattcgattccgattgtagcaaggagaatcagtttccgatttcggtttcc 
tcttcgctccaatcctcagtctcgatcaccgaagctccgtcagcaaaatccaagaccgtgaa 
gaccaaatccgccgcagatcggagtaaaaagcgagatatcgatgcagagatcgaagaagtag 
agaaggagatcggacgattatcgacgaaattggagtcgctccgattagagaaggcggagcaa 
accgcaagaagcattgctatacgtggaagaatcgttccggcgaagttcatggaatcatctca 
gaaacaagtgaaattcgacgattcgtgttttacaggatcgaaatcaagagccactcgtagag 
gcgttagtcttggaccagcggagatattcaattccgcgaagaaatctgaaactgtgactcct 
cttcaatcagctcagaatcgacgcaagtcttgtttctttaagcttcctggaatcgaagaagg 
tcaagtgacgacacgaggtaaaggaagaacgagtttgagtctgagtccgagatctcgcaaag 
cgaaaatgacggcagctcagaagcaagcagctacgacggtggggtcaaagagagctgtgaag 
aaagaagaaggagttctcttaacaatccagcctaagaggctattcaaagaagatgaaaagaa 
tgtttctttaaggaaaccattgaaaccaggaagagttgtggctagtaggtacagtcaaatgg 
gtaaaacgcagactggagagaaagatgttaggaaaaggtcgttgcctgaggatgaagagaaa 
gagaatcataagaggtcggagaagagaagagcttctgatgaaagtaacaagagtgaagggag 
agtgaagaagagatgggagattccaagtgaagttgatctgtatagcagtggtgagaacggtg 
acgagtctcctatagttaaggagctacctaagatcagaacgcttcgtcgtgtgggagggagc 
cctcgtgattcaggtgctgctaagagagttgcagaattacaagccaaggatcgtaacttcac 
tttttgccagcttctgaagtttgaagaatgaatgatccgcttatcaatttgagtaaaatcca 
caactcttgttgtggtt 



SEQ ID NO 12: Arabidospis thallana. seedyl protein 

MTSIEATETLNAPPKLQIWNNAAFDDGDSQITSAIEASSWSHLNESFDSDCSKENQFPISVS 
SSLQSSVSITEAPSAKSKTVKTKSAADRSKKRDIDAEIEEVEKEIGRLSTKLESLRLEKAEQ 
TARSIAIRGRIVPAKFMESSQKQVKFDDSCFTGSKSRATRRGVSLGPAEIFNSAKKSETVTP 
LQSAQNRRKSCFFKLPGIEEGQVTTRGKGRTSLSLSPRSRKAKMTAAQKQAATTVGSKRAVK 
KEEGVLLTIQPKRLFKEDEKNVSLRKPLKPGRVVASRYSQMGKTQTGEKDVRKRSLPEDEEK 
ENHKRSEKRRASDESNKSEGRVKKRWEIPSEVDLYSSGENGDESPIVKELPKIRTLRRVGGS 
PRDSGAAKRVAELQAKDRNFTFCQLLKFEE 
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SEQ ID NO 13: Sequence of the [PRO0090 - CDS0689 - terminator] 
expression cassette 

cttctacatcggcttaggtgtagcaacacgactttattattattattattattatitattatt 
attttacaaaaatataaaatagatcagtccctcaccacaagtagagcaagttggtgagttat 
tgtaaagttctacaaagctaatttaaaagttattgcattaacttatttcatattacaaacaa 
gagtgtcaatggaacaatgaaaaccatatgacatactataattttgtttttattattgaaat 
tatataattcaaagagaataaatccacatagccgtaaagttctacatgtggtgcattaccaa 
aatatatatagcttacaaaacatgacaagcttagtttgaaaaattgcaatccttatcacatt 
gacacataaagtgagtgatgagtcataatattattttctttgctacccatcatgtatatatg 
atagccacaaagttactttgatgatgatatcaaagaacatttttaggtgcacctaacagaat 
atccaaataatatgactcacttagatcataatagagcatcaagtaaaactaacactctaaag 
caaccgatgggaaagcatctataaatagacaagcacaatgaaaatcctcatcatccttcacc 
acaattcaaatattatagttgaagcatagtagtaatttaaatcaactagggatatcacaagt 
ttgtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtc 
cgctgacgcgtgggttccactacatcaagacatctactacactcatcttttttgcacttatt 
gggtgtaaatttttgaaacccagttgagaaaaatgagtgtgttacaatacccagaagggatt 
gacccagcagatgttcagatatggaacaatgcagcatttgataatggagattctgaagattt 
gtcttcgctgaaacgttcttggtctcctctgaaacccctttcggttaggccatcagattcct 
ttgaatctgatttgtcaagtaaggaaaatcaaactcctttatttgagaattcatctgttaat 
ctctcatctccgttacccataaagccacttaaccctaatggggctctggaaaattcaagact 
caagccgaacaagcccaattccaaacagagtcttgatgagatggcggctagaaagagcggaa 
agggaaatgatttccgtgatgagaagaaaatagacgaggaaattgaagaaattcagatggag 
attagtaggttgagttcaagattagaggctttgagaattgaaaaggctgagaaaactgttgc 
taagactgttgaaaagcgaggaagggttgtggcagcaaagtttatggagccaaaacaaagtg 
ttattaagattgaagagcgtatatcaatgagtgcaagaacaaaggtggagcagagaaggggt 
cttagtttaggaccatctgagatttttactggaacgcggcggcgagggttgagtatggggcc 
atcagatattctagcagggacaacaaaggcacggcaattgggaaagcaagagatgattatta 
ctcctattcagccaatacaaaacaggcgaaagtcgtgtttttggaagcttcaagagattgaa 
gaagagggaaaaagttcaagccttagtcctaaatcaagaaaaactgctgcaagaacaatggt 
tacaacaaggcaggcagttactacaattgcatcaaagaagaatttgaaaaaagatgatggac 
ttttgagttcagttcagccaaagaagttgtttaaagatctcgaaaagtctgctgctgctaat 
aagaagccccagaggccggggagggttgtggctagtaggtataatcagagtacaattcagtc 
atcagtagtgagaaagaggtctttacctgaaaatgataaggatgagagtaagagaaatgata 
agaaacggtcgttatctgtagggaaaacgcgtgtgtctcaaactgagagcaagaatttgggt 
actgaaagtagggtgaaaaagagatgggaaattcctagtgagattgtagttcatggaaacac 
agagagtgagaaatctccactaagcattattgtgaagcctgatttgcttccgcgaattagga 
ttgctcggtgtgtgaatgagactcttagggattctggacctgctaaaagaatgatagagttg 
ataggcaagaaatcgtttttcagtagtgatgaagataaggagccacctgtctg-tcaagtttt 
aagttttgcagaggaagatgctgaagaggaataatgtgtaataaagggagctgctaactctt 
ttcatgctctttcaattttcaatcctgccttttaatttttgttcattcgtgccttttaattg 
aatggggaagcattcttttgcttcctcaaactggtattctagcttctgaattacattgtatg 
gtacaat.atgaataaggttttgtcttccggcaggttgtccaagttagtttttagcttaaaat 
agatgcggcagcggccgctctagagtatccctcgaggggcccaagcttacgcgtacccagct 
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ttcttgtacaaagtggtgatatcacaagcccgggcggtcttctagggataacagggtaatta 
tatccctctagatcacaagcccgggcggtcttctacgatgattgagtaataatgtgtcacgc 
atcaccatgggtggcagtgtcagtgtgagcaatgacctgaatgaacaattgaaatgaaaaga 
aaaaaagtactccatctgttccaaattaaaattcattttaaccttttaataggtttatacaa 
taattgatatatgttttctgtatatgtctaatttgttatcatccgggcggtcttctagggat 
aacagggtaattatatccctctagacaacacacaacaaataagagaaaaaacaaataatatt 
aatttgagaatgaacaaaaggaccatatcattcattaactcttctccatccatttccatttc 
acagttcgatagcgaaaaccgaataaaaaacacagtaaattacaagcacaacaaatggtaca 
agaaaaacagttttcccaatgccataatactcgaac 



SEQ ID NO 14: rice prolamin RP6 promoter sequence 

ccttctacatcggcttaggtgtagcaacacgactttattattattattattattattattat 
tattttacaaaaatataaaatagatcagtccctcaccacaagtagagcaagttggtgagtta 
ttgtaaagttctacaaagctaatttaaaagttattgcattaacttatttcatattacaaaca 
agagtgtcaatggaacaatgaaaaccatatgacatactataattttgtttttattattgaaa 
ttatataattcaaagagaataaatccacatagccgtaaagttctacatgtggtgcattacca 
aaatatatatagcttacaaaacatgacaagcttagtttgaaaaattgcaatccttatcacat 
tgacacataaagtgagtgatgagtcataatattatttttcttgctacccatcatgtatatat 
gatagccacaaagttactttgatgatgatatcaaagaacatttttaggtgcacctaacagaa 
tatccaaataatatgactcacttagatcataatagagcatcaagtaaaactaacactctaaa 
gcaaccgatgggaaagcatctataaatagacaagcacaatgaaaatcctcatcatccttcac 
cacaattcaaatattatagttgaagcatagtagtagaatccaacaaca 



SEQ ID NO 15: Motif 1 CORE SEQUENCE 

WXNAXXD 



SEQ ID NO 16: Motif 2 CORE SEQUENCE 

KENXXP 



SEQ ID NO 17: Motif 3 (coiled coil) CORE SEQUENCE 

EX X - 6 EXXRLXXXLXXLR 



SEQ ID NO 18: Motif 4 CORE SEQUENCE 

LPXIXi-ioRDSGXXKRXi-eK 



FIGURE 4 (continued) 



105-Seedyl-EP.ST25 .txt 
SEQUENCE LISTING 



<110> CropDesign N.V. 

<120> Seedyl sequence for making plants having changed growth characteristics 

<130> 105-Seedyl-EP 

<160> 18 

<170> Patentln version 3.1 

<210> 1 
<211> 1428 
<212> DNA 

<213> Nicotiana tabaeum 
<220> 

<221> misc_feature 

<223> seedyl coding sequence (CDS0689) 
<400> 1 

atgagtgtgt tacaataccc agaagggatt gacccagcag atgttcagat atggaacaat 
gcagcatttg ataatggaga ttctgaagat ttgtcttcgc tgaaacgttc ttggtctcct 
ctgaaacccc tttcggttag gccatcagat tcctttgaat ctgat_ttgtc aagtaaggaa 
aatcaaactc ctttatttga gaattcatct gttaatctct catctccgtt acccataaag 
ccacttaacc ctaatggggc tctggaaaat tcaagactca agccgaacaa gcccaattcc 
aaacagagtc ttgatgagat ggcggctaga aagagcggaa agggaaatga tttccgtgat 
gagaagaaaa tagacgagga aattgaagaa attcagatgg agattagtag gttgagttca 
agattagagg ctttgagaat tgaaaaggct gagaaaactg ttgctaagac tgttgaaaag 

Page 1 



60 
120 
180 
240 
300 
360 
420 
480 
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cgaggaaggg ttgtggcagc aaagtttatg gagccaaaac aaagtgttat taagattgaa 540 

gagcgtatat caatgagtgc aagaacaaag gtggagcaga gaaggggtct tagtttagga 600 

ccatctgaga tttttactgg aacgcggcgg cgagggttga gtatggggcc atcagatatt 660 

ctagcaggga caacaaaggc acggcaattg ggaaagcaag agatgattat tactcctatt 720 

cagccaatac aaaacaggcg aaagtcgtgt ttttggaagc ttcaagagat tgaagaagag 780 

ggaaaaagtt caagccttag tcctaaatca agaaaaactg ctgcaagaac aatggttaca 840 

acaaggcagg cagttactac aattgcatca aagaagaatt tgaaaaaaga tgatggactt 900 

ttgagttcag ttcagccaaa gaagttgttt aaagatctcg aaaagtctgc tgctgctaat 960 

aagaagcccc agaggccggg gagggttgtg gctagtaggt ataatcagag tacaattcag 1020 

tcatcagtag tgagaaagag gtctttacct gaaaatgata aggatgagag taagagaaat 1080 

gataagaaac ggtcgttatc tgtagggaaa acgcgtgtgt ctcaaactga gagcaagaat 1140 

ttgggtactg aaagtagggt gaaaaagaga tgggaaattc ctagtgagat tgtagttcat 1200 

ggaaacacag agagtgagaa atctccacta agcattattg tgaagcctga tttgcttccg 1260 

cgaattagga ttgctcggtg tgtgaatgag actcttaggg attctggacc tgctaaaaga 1320 

atgatagagt tgataggcaa gaaatcgttt ttcagtagtg atgaagataa ggagccacct 1380 

gtctgtcaag ttttaagttt tgcagaggaa gatgctgaag aggaataa 1428 

<210> 2 
<211> 475 
<212> PRT 

<213> Nicotiana tabacrum 



<220> 

<2 2 1> MXS COFEATURE 

<223> seedyl protein (CDS0689) 

<400> 2 

Met Ser Val Leu Gin Tyr Pro Glu Gly lie Asp Pro Ala Asp Val Gin 
1*5 10 15 

lie Trp Asn Asn Ala Ala Phe Asp Asn Gly Asp Ser Glu Asp Leu Ser 
20 25 30 
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Ser Leu Lys Arg Ser Trp Ser Pro Leu Lys Pro Leu Ser Val Arg Pro 
35 40 45 



Ser Asp Ser Phe Glu Ser Asp Leu Ser Ser Lys Glu Asn Gin Thr Pro 
50 55 60 



Leu Phe Glu Asn Ser Ser Val Asn Leu Ser Ser Pro Leu Pro He Lys 
65 70 75 80 



Pro Leu Asn Pro Asn Gly Ala Leu Glu Asn Ser Arg Leu Lys Pro Asn 
85 90 95 



Lys Pro Asn Ser Lys Gin Ser Leu Asp Glu Met Ala Ala Arg Lys Ser 
100 105 110 



Gly Lys Gly Asn Asp Phe Arg Asp Glu Lys Lys He Asp Glu Glu He 
115 120 125 



Glu Glu He Gin Met Glu He Ser Arg Leu Ser Ser Arg Leu Glu Ala 
130 135 140 



Leu Arg He Glu Lys Ala Glu Lys Thr Val Ala Lys Thr Val Glu Lys 
145 150 155 160 



Arg Gly Arg Val Val Ala Ala Lys Phe Met Glu Pro Lys Gin Ser Val 
165 170 175 



He Lys He Glu Glu Arg He Ser Met Ser Ala Arg Thr Lys Val Glu 
180 185 190 



Gin Arg Arg Gly Leu Ser Leu Gly Pro Ser Glu He Phe Thr Gly Thr 
195 200 205 



Arg Arg Arg Gly Leu Ser Met Gly Pro Ser Asp He Leu Ala Gly Thr 
210 215 220 - 



Thr Lys Ala Arg Gin Leu Gly Lys Gin Glu Met He He Thr Pro He 
225 230 235 240 



Gin Pro He Gin Asn Arg Arg Lys Ser Cys Phe Trp Lys Leu Gin Glu 
245 250 255 



He Glu Glu Glu Gly Lys Ser Ser Ser Leu Ser Pro Lys Ser Arg Lys 
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260 265 270 



Thr Ala Ala Arg Thr Met Val Thr Thr Arg Gin Ala Val Thr Thr lie 
275 280 285 



Ala Ser Lys Lys Asn Leu Lys Lys Asp Asp Gly Leu Leu Ser Ser Val 
290 295 ^ 300 



Gin Pro Lys Lys Leu Phe Lys Asp Leu Glu Lys Ser Ala Ala Ala Asn 
305 310 315 320 

Lys Lys Pro Gin Arg Pro Gly Arg Val Val Ala Ser Arg Tyr Asn Gin 
325 330 335 



Ser Thr lie Gin Ser Ser Val Val Arg Lys Arg Ser Leu Pro Glu Asn 
340 345 350 

Asp Lys Asp Glu Ser Lys Arg Asn Asp Lys Lys Arg Ser Leu Ser Val 
355 360 365 

Gly Lys Thr Arg Val Ser Gin Thr Glu Ser Lys Asn Leu Gly Thr Glu 
370 375 380 

Ser Arg Val Lys Lys Arg Trp Glu He Pro Ser Glu He Val Val His 
385 390 395 400 

Gly Asn Thr Glu Ser Glu Lys Ser Pro Leu Ser lie lie Val Lys Pro 
405 410 415 



Asp Leu Leu Pro Arg lie Arg Xle Ala Arg Cys Val Asn Glu Thr Leu 
420 425 430 



Arg Asp Ser Gly Pro Ala Lys Arg Met He Glu Leu He Gly Lys Lys 
435 440 445 

Ser Phe Phe Ser Ser Asp Glu Asp Lys Glu Pro Pro Val Cys Gin Val 
450 455 ^ 460 



Leu Ser Phe Ala Glu Glu Asp Ala Glu Glu Glu 
465 470 475 



<210> 3 
<211> 1336 
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<212> DNA 

<213> Oryza sativa 

<220> 

<221> miscjfeature 

<223> seedyl coding sequence 



<400> 3 
atggaggagg 


acccgctcat 


cccgctggtc 


cacgtctgga 


acaacgccgc 


cttcgacgac 


60 


tcctcgtgtt 


ccagatcggc 


ttggctcccc 


caaagccccg 


ccgtcgcggc 


cgtccgcaag 


120 


ggcgacaagg 


agaatcaccg 


ccccgaggtt 


gttgatgtcg 


ccgccggcta 


cgacgtcgag 


180 


gccgagatcg 


gccacatcga 


ggcggagatc 


ctgcgcctct 


cgtcccggct 


ccaccatctc 


240 


cgcgtctcca 


agcagccgga 


gcccaaccgc 


gacgacgctc 


cgatggggga 


gatggtcgcg 


300 


aaggtgaggc 


cccggccgag 


gggcctcagc 


ctcgggcccc 


tggatgtgat 


ctccatcgtc 


360 


aatcgtgaga 


agcatccgct 


gcgcaccaag 


cagcctccgg 


cgacgcgggg 


cagggggctc 


420 


agcctcgggc 


ccatggagat 


cgccgcggcg 


aaccctaggg 


tgcccgcggc 


ggcgcagcat 


480 


cagcaacagc 


aacgcgctgg 


cacggcgcgg 


atcctgaagc 


caatcaagga 


gcctccggtg 


540 


cagcgtcgca 


ggggcgtcag 


cctcgggccg 


ttggagatcc 


accacggcgt 


cggcagcaag 


600 


gcaccagcgg 


cggcgcgagc 


caagccgttc 


accaccaagc 


tcaacgccat 


tcgagaagaa 


660 


acccgaccct 


ccaagcaatt 


cgccgtcccc 


gccaagccat 


ggccgtcgag 


caatacaagg 


720 


cagacactgg 


actcgaggca 


aggaacagca 


gcaagtcgag 


cgaaggcgag 


gagcccgagc 


780 


cccaggccca 


ggaggcaatc 


caatggcaag 


gctactgaca 


caaggggagg 


caacaaggtg 


840 


gtggatgagc 


tcaagcccaa 


aggtgcgtcg 


tcaagtcaga 


gcggcagcgc 


cgccgccgcc 


900 


gccactgcca 


agaggatggc 


ggggagctcc 


aagatgaggg 


tcatcccgag 


ccgctacagc 


960 


ctcactcctg 


gcgcttccct 


tggaagcagt 


ggagcacagg 


a^gaggcgacg 


caagcagtct- 


1020 


ctcccaggat 


catcagggga 


tgcgaaccag 


aatgaggaaa 


tcagagcgaa 


ggtcatcgag 


1080 


ccttccaatg 


atccactctc 


tcctcaaacg- 


atctccaagg 


ttgctgaaat 


gctcccaaag 


1140 


atcaggacca 


tgccgcctcc 


tgacgagagc 


cctcgcgatt 


ccggatgcgc 


caagcgggtt 


1200 


gccgaattgg 


tcgggaagcg 


ctcgttcttc 


acggctgcag 


ccgaggacgg 


gcgggcgctc 


1260 


gacgtcgaag 


cacccgaggc 


ggtcgcagaa 


gcttgagatg 
Page 


aaccaccatg 
5 


gtttgatccg 


1320 



ttccttccat cagctc 
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1336 



<210> 4 

<211> 431 

<212> PRT 

<213> Oryza sativa 



<220> 

<221> MIS COFEATURE 
<223> seedyl protein 



<400> 4 

Met Glu Glu Asp Pro Leu He Pro Leu Val His Val Trp Asn Asn Ala 



10 

1 3 



Ala Phe Asp Asp Ser Ser Cys Ser Arg Ser Ala Trp Leu Pro Gin Ser 
20 25 

Pro Ala Val Ala Ala Val Arg Lys Gly Asp Lys Glu Asn His Arg Pro 
35 40 

, ,i Tvi = n„ Tvr Ran Val Glu Ala Glu He Gly 
Glu val Val Asp Val Ala Ala Gly Tyr Asp vaj. ^ 

50 55 

His He Glu Ala Glu He Leu Arg Leu Ser Ser Arg Leu His His Leu 
65 70 

^ Val ser Lys Gin Pro Glu Pro Asn *, Asp Asp Ala Pro Met Gly 
85 

Glu Her vsl Ala Lys Val AT, Pro Arg Pro Arg Gly leu ser leu Gly 
100 105 

Er „ Leu Asp vsl lie ser He V.1 Asn Arg Glu Lys His Pro Leu Ar, 
115 120 

Th r Lys Gin Pro Pro Ala Thr Arg Gly Arg Gly Leu Ser Leu Gly Pro 



130 135 
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Met Glu lie Ala Ala Ala Asn Pro Arg Val Pro Ala Ala Ala Gin His 
145 150 155 160 



Gin Gin Gin Gin Arg Ala Gly Thr Ala Arg lie Leu Lys Pro He Lys 
165 170 175 



Glu Pro Pro Val Gin Arg Arg Arg Gly Val Ser Leu Gly Pro Leu Glu 
180 185 190 

He His His Gly Val Gly Ser Lys Ala Pro Ala Ala Ala Arg Ala Lys 
195 200 205 

Pro Phe Thr Thr Lys Leu Asn Ala He Arg Glu Glu Thr Arg Pro Ser 
210 215 220 



Lys Gin Phe Ala Val Pro Ala Lys Pro Trp Pro Ser Ser Asn Thr Arg 
225 230 235 240 



Gin Thr Leu Asp Ser Arg Gin Gly Thr Ala Ala Ser Arg Ala Lys Ala 
245 250 255 



Arg Ser Pro Ser Pro Arg Pro Arg Arg Gin Ser Asn Gly Lys Ala Thr 
260 265 270 



Asp Thr Arg Gly Gly Asn Lys Val Val Asp Glu Leu Lys Pro Lys Gly 
275 280 285 



Ala Ser Ser Ser Gin Ser Gly Ser Ala Ala Ala Ala Ala Thr Ala Lys 
290 295 300 



Arg Met Ala Gly Ser Ser Lys Met Arg Val He Pro Ser Arg Tyr Ser 
305 310 315 320 



Leu Thr Pro Gly Ala Ser Leu Gly Ser Ser Gly Ala Gin Glu Arg Arg 
325 330 335 



Arg Lys- Gin Ser Leu Pro Gly "Ser Ser Gly Asp Ala Asn Gin Asn Glu 
340 345 350 



Glu He Arg Ala Lys Val lie Glu Pro Ser Asn Asp Pro Leu Ser Pro 
355 360 365 



Gin Thr He Ser Lys Val Ala Glu Met Leu Pro Lys He Arg Thr Met 
370 375 380 
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Pro Pro Pro Asp Glu Ser Pro Arg Asp Ser Gly Cys Ala Lys Arg Val 
385 390 395 400 

Ala Glu Leu Val Gly Lys Arg Ser Phe Phe Thr Ala Ala Ala Glu Asp 
405 410 415 

Gly Arg Ala Leu Asp Val Glu Ala Pro Glu Ala Val Ala Glu Ala 
420 425 430 

<210> 5 
<211> 1860 
<212> DNA 

<213> Medicago trunculata 



<220> 

<221> misc_feature 

<223> seedy 1 coding sequence 

<400> 5 

aaaaacgtta aggactaaaa atataataaa atttaagtag ggattcataa tggaagcacc 60 

cctatttaca gggatcttaa atataattaa ccctaatatt tatgacagaa acccttttga 120 

aatcacatcg gagcgtgtat gagtagccgt ttcacatcca acggccagta agagcgtaac 180 

tttatttctt ccctcttcaa tctccaacgg tcacataatc tcttccaaat acaaataatt 240 

ccctctttca acctcactct tcatttcttc aacccaaacc caaaaaacta atcagattct 300 

tcttaaatct tgaaaccttt ctcccaaaag cacttaaata aaaaagcact taaccatgaa 360 

taacacaaac aacaacaaca ttcttcttca ttccacacag gttcaagtgt ggaacaacgc 420 

agcattcgat ggtgaagatt tcgccatgaa ttcatcttct gattccatca aagagaatct 480 

aaacccatcc gcattcaaca ttgttccttc ttcaaacaaa agaactattg atgatgaaat 540 

tgcggaaatt gaaagtgaaa ttaagcgatt aacttcgaag ctggaattgc ttcgtgttga 600 

aaaagctgaa agaaaaatcg cttctgaaaa gcgtgttagt ggaattggta ctggaagaat 660 

agtagcagcg aagtttatgg aaccgaagaa aaacgttaca ccgaaacgaa acggtgtcgt 720 

tttcaaggag gagacaccga aacgaaacgg tgtcgtttcg gatacgccga aatctagggt 780 

taattggaga agagggatga gtttaggtcc gatggagatt gccgggaaag tgatggcacc 840 
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gccggcgatg 


acgattactc 


cggcgacggt 


gaatcggagg 


aagtcttgtt tctggaaacc 




gcaggaaagt 


tgtgaagtaa 


tgccgtcggg 


gattactccg 


gcgacggtga ataggaggaa 


Q C f\ 

you 


atcttgtttt 


ttgaaacctc 


aagaaagttg 


tgaagaaaat 


cgaagaaaaa cgatttgcaa 


1020 


accgaatttg 


aatttgaatt 


caaattcagt 


taattctgcg 


gttggatcga ttaagcgtgt 


1080 


gaagaagaaa 


gatgaagaaa 


ttgctcaggt 


tcaaccgaag 


aagctgtttg aaggtgaaaa 


1140 


atcagtgaag 


aaatcgttga 


aacaaggtag 


aattgttgca 


agccggtata attccggtgg 


1200 


tggtggtggt 


gatgcgagga 


aaagatcgtt 


ttcggagaat 


aataagggtt tagggagtga 


1260 


aatcagggct 


aagaagagat 


gggagatacc 


aattgaagaa 


gtggatgtga gtggttttgt 


1320 


tatgttaccg 


aagatttcga 


caatgaggtt 


tgttgatgag 


agtcctagag attctggtgc 


1380 


tgttaaaaga 


gttgctgaat 


tgaatggaaa 


aagatcttac 


ttttgtgatg aagatgagga 


1440 


ggagagagtg 


atggtggagg 


aagaaggtgg 


ttctgtttgt 


caggttttga attttgctga 


1500 


agatgatgat 


gatgatgatg 


attatggtga 


acaagggtaa 


ttgtggaaat tggaattgat 


1560 


ttgtttttgt 


ggggttgtgt 


ggaactggct 


atgttctgct 


tgattctttt gcattttggt 


1620 


gtgaaactaa 


agatgaggtg 


aaaagtttat 


gcttgttaaa 


ttggattggt ttatatgttt 


1680 


tgaaataata 


acaacaagca 


tgtgtcttgc 


ttaataattg 


tatattgttt tgtttgtttt 


1740 


ataatgatat 


ggatttaatt 


tgtatacaca 


atataatata 


gtatgcattg agagagtttt 


1800 


tcgttcagta 


ttcattctga 


ttttagtgtt 


tatctcattc 


tagaagattg tattttgttg 


1860 



<210> 6 

<211> 394 

<212> PRT 

<213> Medicago trunculata 
<220> 

<22X> MIS COFEATURE ""~ 

<223> seedyl protein 

<400> 6 

Met Asn Asn Thr Asn Asn Asn Asn He Leu Leu His Ser Thr Gin Val 
15 10 15 
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Gin Val Trp Asn Asn Ala Ala Phe Asp Gly Glu Asp Phe Ala Met Asn 
20 25 30 

Ser Ser Ser Asp Ser He Lys Glu Asn Leu Asn Pro Ser Ala Phe Asn 
35 ~ 40 45 

He Val Pro Ser Ser Asn I»ys Arg Thr He Asp Asp Glu lie Ala Glu 
50 55 60 

He Glu Ser Glu He Lys Arg Leu Thr Ser Lys Leu Glu Leu Leu Arg 
65 70 75 80 

Val Glu Lys Ala Glu Arg Lys He Ala Ser Glu Lys Arg Val Ser Gly 
85 90 95 

He Gly Thr Gly Arg He Val Ala Ala Lys Phe Met Glu Pro Lys Lys 
100 " 105 110 

Asn Val Thr Pro Lys Arg Asn Gly Val Val Phe Lys Glu Glu Thr Pro 
115 120 125 

Lys Arg Asn Gly Val Val Ser Asp Thr Pro Lys Ser Arg Val Asn Trp 
130 135 140 

Arg Arg Gly Met Ser Leu Gly Pro Met Glu He Ala Gly Lys Val Met 
145 150 155 160 

Ala Pro Pro Ala Met Thr He Thr Pro Ala Thr Val Asn Arg Arg Lys 
165 170 175 

Ser Cys Phe Trp Lys Pro Gin Glu Ser Cys Glu Val Met Pro Ser Gly 
180 185 190 

He Thr Pro Ala Thr Val Asn Arg Arg Lys Ser Cys Phe Leu Lys Pro 
195 200 205 

Gin Glu Ser Cys Glu Glu Asn Arg Arg Lys Thr He Cys Lys Pro Asn 
210 215 220 

Leu Asn Leu Asn Ser Asn Ser Val Asn Ser Ala Val Gly Ser He Lys 
225 230 235 240 

Arg Val Lys Lys Lys Asp Glu Glu He Ala Gin Val Gin Pro Lys Lys 
245 250 255 
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Leu Phe Glu Gly Glu Lys Ser Val Lys Lys Ser Leu Lys Gin Gly Arg 
260 265 270 

lie Val Ala Ser Arg Tyr Asn Ser Gly Gly Gly Gly Gly Asp Ala Arg 
275 280 285 



Lys Arg Ser Phe Ser Glu Asn Asn Lys Gly Leu Gly Ser Glu lie Arg 
290 295 300 

Ala Lys Lys Arg Trp Glu He Pro He Glu Glu Val Asp Val Ser Gly 

305 ' 310 315 320 



Phe Val Met Leu Pro Lys He Ser Thr Met Arg Phe Val Asp Glu Ser 
325 330 335 



Pro Arg Asp Ser Gly Ala Val Lys Arg Val Ala Glu Leu Asn Gly Lys 
340 345 350 



Arg Ser Tyr Phe Cys Asp Glu Asp Glu Glu Glu Arg Val Met Val Glu 
355 360 365 

Glu Glu Gly Gly Ser Val Cys Gin Val Leu Asn Phe Ala Glu Asp Asp 
370 375 380 



Asp Asp Asp Asp Asp Tyr Gly Glu Gin Gly 
385 390 



<210> 7 

<211> 674 

<212> DNA 

<213> Saccharum sp. 

<220>. . - - ' 

<221> misc_feature 

<223> seedyl coding sequence (partial 5' end) 
<220> 

<221> mis cofeature 
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<222> (362).. (362) 

<223> n can be a, c, g or t 

<220> 

<221> misc_feature 

<222> (372) . . (372) 

<223> n can be a, c, g or t 

<220> 

<221> misc_feature 

<222> (674) . . (674) 

<223> n can be a, c, g or t 



<400> 7 






60 


cgcaccgcga 


gtttcgaaaa accaacctat cgcgcctcag 


atcacgcgag gacgcgaggg 


gaagcaggaa 


tccctccgct. cccagccgcc tcctccgctc 


acccatcgat cgatcgtccg 


120 


tccggtccag 


ggggctctcc ggcggcggtg gcgatggagg 


aggacccgct, catcccgctg 


180 


gtgcacgtct 


ggaacaacgc cgccttcgac cacgcctcct 


cctccgcgtg gcacgcccac 


240 


tcccctgtgc 


ccgcgagcgc acgtcgcgag gcggaggggg 


acaaggagaa ccaccgcccc 


300 


gaccccgacc 


ccgacgtcga ggcggagatc ggccacatcg 


aggcggagat cctgcgcctg 


360 


tnctcccgcc 


tncaccacct tcgcacctcc aagcagtcgg 


agccgtccaa gcgcggagag 


420 


gtcgcgcccg 


cgcccgcggc gaaggcgaaa gcggcggcgg 


cggcgcggct gcggacgcgg 


480 


gggctcagcc 


tgggcccgct cgacgtcgcc gctgccggta 


accccaaccc gctcaccacc 


540 


gacaaccagc 


agcagcagcc gcgtgccgcg cagggtctga 


agccgatcaa gcaggccacg 


600 


gcggcggcgg 


gcaagggcgt aagacttggg ccccttcgac 


atggtcggcg cgaaccctag 


660 


ggtccctccg 


cccn 




674 



<210> 8 
<211> 166 
<212> PRT 
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<213> Saccharum sp. 



<220> 

<221> MISC__FEATURE 

<223> seedyl protein 

<220> 

<221> MI S COFEATURE 

<223> seedyl protein (partial N term) 
<220> 

<221> MI S COFEATURE 

<222> (70).. (70) 

<223> Xaa can be any amino acid 

<400> 8 

Met Glu Glu Asp Pro Leu lie Pro Leu Val His Val Trp Asn Asn Ala 

15 10 15 



Ala Phe Asp His Ala Ser Ser Ser Ala Trp His Ala His Ser Pro Val 
20 25 30 



Pro Ala Ser Ala Arg Arg Glu Ala Glu Gly Asp Lys Glu Asn His Arg 
35 40 45 



Pro Asp Pro Asp Pro Asp Val Glu Ala Glu He Gly His He Glu Ala 
50 55 60 



Glu lie Leu Arg -Leu Xaa Ser Arg Leu His His Leu Arg Thr Ser Lys 
65 70 " 75 80 



Gin Ser Glu Pro Ser Lys Arg Gly Glu Val Ala Pro Ala Pro Ala Ala 
85 90 95 



Lys Ala Lys Ala Ala Ala Ala Ala Arg Leu Arg Thr Arg Gly Leu Ser 
100 105 110 
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Leu Gly Pro Leu Asp Val Ala Ala Ala Gly Asn Pro Asn Pro Leu Thr 
115 120 125 

Thr Asp Asn Gin Gin Gin Gin Pro Arg Ala Ala Gin Gly Leu Lys Pro 
130 135 140 

He Lys Gin Ala Thr Ala Ala Ala Gly Lys Gly Val Arg Leu Gly Pro 
145 150 155 160 

Leu Arg His Gly Arg Arg 
165 

<210> 9 
<211> 876 
<212> DNA 
<213> Zea Mays 

<220> 

<221> mis cofeature 

<223> seedyl coding sequence (partila 3" end) 
<220> 

<221> misc_feature 

<222> (869) . . (869) 



<223> n = 


a, c, g or 


t 






<400> 9 
ccacgcgtcc 


ggccgttcga 


gaggaggaag gccagcgttc caaggagcac 


gccgtccccg 


60 


ccagaccgtg 


gccatccagc 


aatgccaggc acccactgga tgccaggcaa 


ggcaccgcag 


120 


caagcagagc 


caaggcgagg 


agcgggagca taagccccag caggttcagg 


aggcagtcca 


180 


cttccaaggc 


tgccgagaca 


agagcgggaa atgccaagcc tacagaggcg 


acgaggggag 


240 


ggagcgaagc 


ggtcaatcac 


accagcaatg tagccacgac gaagaggccg 


gcggggagct 


300 


ccaaggtcag ggttgtcccg 


agccgctaca gcatcccacc tggctcctcc 


ctagcagctg 


360 


tgacacaagg 


caaccgatgc 


aagcagtctc tcccaggatc ggctactgag accagagtaa 


420 
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atctcactga 


gccgccgaac 


gacgagttgt 


ctcctgaaga acttgccaag gttgcagagc 


480 


tgctcccaag 


gattaggacc 


atgccgcctt 


ctgatgagag cccgcgtgac tcgggatgtg 


540 


ccaagcgtgt 


tgctgatttg 


gtcgggaagc 


gatccttctt cactgctgca ggggacgatg 


600 


gcaatctcgt 


tacgccctac 


caggcacggg 


tggttgaact tgaatcaccc gaggcagcag 


660 


cagaagaagc 


agaagcttga 


gaagtttgtc 


tttgatcaat tccgaagtgg cttgcatctg 


720 


ggcgtggcct 


ctttttgcag 


tgtgtgctac 


tacatagtct actgttacat tcatatcata 


780 


tcacatttcc 


tattttttcc 


cccttgagac 


attgcttagt acttttgtgt tgccttgtga 


840 


aaagagagtg 


gaaggttcat 


ctgctgatnc 


cttgtt 


876 



<210> 10 

<211> 224 

<212> PRT 

<213> Zea Mays 



<220> 

<221> MIS COFEATURE 

<223> seedyl protein (partial C term) 



<400> 10 

Thr Arg Pro Ala Val Arg Glu Glu Glu Gly Gin Arg Ser Lys Glu His 
15 10 15 

Ala Val Pro Ala Arg Pro Trp Pro Ser Ser Asn Ala Arg His Pro Leu 
20 25 30 

Asp Ala Arg Gin Gly Thr Ala Ala Ser Arg Ala Lys Ala Arg Ser Gly 

35 40 . . 45 ... 

Ser lie Ser Pro Ser Arg Phe Arg Arg Gin Ser Thr Ser Lys Ala Ala 
50 55 60 



Glu Thr Arg Ala Gly Asn Ala Lys Pro Thr Glu Ala Thr Arg Gly Gly 
65 70 75 80 



Ser Glu Ala Val Asn His Thr Ser Asn Val Ala Thr Thr Lys Arg Pro 
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85 90 95 



Ala Gly Ser Ser Lys Val Arg Val Val Pro Ser Arg Tyr Ser He Pro 
100 105 110 

Pro Gly Ser Ser Leu Ala Ala Val Thr Gin Gly Asn Arg Cys Lys Gin 
115 120 125 

Ser Leu Pro Gly Ser Ala Thr Glu Thr Arg Val Asn Leu Thr Glu Pro 
130 135 140 

Pro Asn Asp Glu Leu Ser Pro Glu Glu Leu Ala Lys Val Ala Glu Leu 
145 150 155 160 

Leu Pro Arg He Arg Thr Met Pro Pro Ser Asp Glu Ser Pro Arg Asp 
165 170 175 

Ser Gly Cys Ala Lys Arg Val Ala Asp Leu Val Gly Lys Arg Ser Phe 
180 185 190 

Phe Thr Ala Ala Gly Asp Asp Gly Asn Leu Val Thr Pro Tyr Gin Ala 
195 * 200 205 

Arg Val Val Glu Leu Glu Ser Pro Glu Ala Ala Ala Glu Glu Ala Glu 
210 215 220 

<210> 11 

<211> 1257 

<212> DNA 

<213> Arabidospis thaliana 
<220> 

<221> miscjEeature 

<223> seedyl coding sequence 

<400> 11 

atgacatcaa ttgaggcaac agaaacgctt aacgctcctc caaagcttca gatctggaac 60 

aacgctgcct tcgacgatgg agattctcaa atcacttccg ccatcgaagc ttcttcttgg 120 

tctcacctca acgaatcatt cgattccgat tgtagcaagg agaatcagtt tccgatttcg 180 
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gtttcctctt 


cgctccaatc 


ctcagtctcg 


atcaccgaag 


ctccgtcagc aaaatccaag 


240 


accgtgaaga 


ccaaatccgc 


cgcagatcgg 


agtaaaaagc 


gagatatcga tgcagagatc 


o f\ n 
300 


gaagaagtag 


agaaggagat 


cggacgatta 


tcgacgaaat 


tggagtcgct ccgattagag 


O £ f\ 

360 


aaggcggagc 


aaaccgcaag 


aagcattgct 


atacgtggaa 


gaatcgttcc ggcgaagttc 


420 


atggaatcat 


ctcagaaaca 


agtgaaattc 


gacgattcgt 


gttttacagg atcgaaatca 


480 


agagccactc 


gtagaggcgt 


tagtcttgga 


ccagcggaga 


tattcaattc cgcgaagaaa 


540 


tctgaaactg 


tgactcctct 


tcaatcagct 


cagaatcgac 


gcaagtcttg tttctttaag 


600 


cttcctggaa 


tcgaagaagg 


tcaagtgacg 


acacgaggta 


aaggaagaac gagtttgagt 


660 


ctgagtccga 


gatctcgcaa 


agcgaaaatg 


acggcagctc 


agaagcaagc agctacgacg 


720 


gtggggtcaa 


agagagctgt 


gaagaaagaa 


gaaggagttc 


tcttaacaat ccagcctaag 


780 


aggctattca 


aagaagatga 


aaagaatgtt 


tctttaagga 


aaccattgaa accaggaaga 


840 


gttgtggcta 


gtaggtacag 


tcaaatgggt 


aaaacgcaga 


ctggagagaa agatgttagg 


900 


aaaaggtcgt 


tgcctgagga 


tgaagagaaa 


gagaatcata 


agaggtcgga gaagagaaga 


960 


gcttctgatg 


aaagtaacaa 


gagtgaaggg 


agagtgaaga 


agagatggga gattccaagt 


1020 


gaagttgatc 


tgtatagcag 


tggtgagaac 


ggtgacgagt 


ctcctatagt taaggagcta 


1080 


cctaagatca 


gaacgcttcg 


tcgtgtggga 


gggagccctc 


gtgattcagg tgctgctaag 


1140 


agagttgcag 


aattacaagc 


caaggatcgt 


aacttcactt 


tttgccagct tctgaagttt 


1200 


gaagaatgaa 


tgatccgctt 


atcaatttga 


gtaaaatcca 


caactcttgt tgtggtt 


1257 



<210> 12 

<211> 402 

<212> PRT 

<213> Arabidospis thaliana 

<220> ... -■ 

<221> MIS COFEATURE 
<223> seedyl protein 

<400> 12 

Met Thr Ser lie Glu Ala Thr Glu Thr Leu Asn Ala Pro Pro Lys Leu 
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10 15 



Gin lie Trp Asn Asn Ala Ala Phe Asp Asp Gly Asp Ser Gin lie Thr 
20 25 30 



Ser Ala lie Glu Ala Ser Ser Trp Ser His Leu Asn Glu Ser Phe Asp 
35 40 45 



Ser Asp Cys Ser Lys Glu Asn Gin Phe Pro lie Ser Val Ser Ser Ser 
50 55 60 

Leu Gin Ser Ser Val Ser lie Thr Glu Ala Pro Ser Ala Lys Ser Lys 
65 70 75 80 



Thr Val Lys Thr Lys Ser Ala Ala Asp Arg Ser Lys Lys Arg Asp lie 
85 90 95 

Asp Ala Glu He Glu Glu Val Glu Lys Glu He Gly Arg Leu Ser Thr 
100 105 110 

Lys Leu Glu Ser Leu Arg Leu Glu Lys Ala Glu Gin Thr Ala Arg Ser 
115 120 ~ 125 

He Ala He Arg Gly Arg He Val Pro Ala Lys Phe Met Glu Ser Ser 
130 135 140 

Gin Lys Gin Val Lys Phe Asp Asp Ser Cys Phe Thr Gly Ser Lys Ser 
145 150 155 160 

Arg Ala Thr Arg Arg Gly Val Ser Leu Gly Pro Ala Glu He Phe Asn 
165 170 175 



Ser Ala Lys Lys Ser Glu Thr Val Thr Pro Leu Gin Ser Ala Gin Asn 

180 185 190 

Arg Arg Lys Ser Cys Phe Phe Lys Leu Pro Gly He Glu Glu Gly Gin 

195 200 205 



Val Thr Thr Arg Gly Lys Gly Arg Thr Ser Leu Ser Leu Ser Pro Arg 
210 ~ 215 220 

Ser Arg Lys Ala Lys Met Thr Ala Ala Gin Lys Gin Ala Ala Thr Thr 
225 " 230 235 240 
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Val Gly Ser Lys Arg Ala Val Lys Lys Glu Glu Gly Val Leu Leu Thr 
245 250 ' 255 



He Gin Pro Lys Arg Leu Phe Lys Glu Asp Glu Lys Asn Val Ser Leu 
260 265 270 

Arg Lys Pro Leu Lys Pro Gly Arg Val Val Ala Ser Arg Tyr Ser Gin 
275 280 285 

Met Gly Lys Thr Gin Thr Gly Glu Lys Asp Val Arg Lys Arg Ser Leu 
290 295 300 

Pro Glu Asp Glu Glu Lys Glu Asn His Lys Arg Ser Glu Lys Ara Ara 
305 310 315 320 

Ala Ser Asp Glu Ser Asn Lys Ser Glu Gly Arg Val Lys Lys Arg Trp 
325 330 ~ 335 

Glu He Pro Ser Glu Val Asp Leu Tyr Ser Ser Gly Glu Asn Gly Asp 
340 345 350 

Glu Ser Pro He Val Lys Glu Leu Pro Lys He Arg Thr Leu Ara Ara 
355 360 365 

Val Gly Gly Ser Pro Arg Asp Ser Gly Ala Ala Lys Arg Val Ala Glu 
370 375 380 

Leu Gin Ala Lys Asp Arg Asn Phe Thr Phe Cys Gin Leu Leu Lvs Phe 
385 390 395 400 

Glu Glu 



<210> 13 

<211> 3074 

"<212>~ DNA " 

<213> Artificial sequence 



<220> 

<223> Sequence of the [PRO0090 - CDS0689 - terminator] expression casse 
tte 
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<400> 13 



60 
120 
180 
240 
300 
360 
420 



cttctacatc ggcttaggtg tagcaacacg actttattat tattattatt attattatta 
ttattttaca aaaatataaa atagatcagt ccctcaccac aagtagagca agttggtgag 
ttattgtaaa gttctacaaa gctaatttaa aagttattgc attaacttat ttcatattac 
aaacaagagt gtcaatggaa caatgaaaac catatgacat actataattt tgtttttatt 
attgaaatta tataattcaa agagaataaa tccacatagc cgtaaagttc tacatgtggt 
gcattaccaa aatatatata gcttacaaaa catgacaagc ttagtttgaa aaattgcaat 
ccttatcaca ttgacacata aagtgagtga tgagtcataa tattattttc tttgctaccc 
atcatgtata tatgatagcc acaaagttac tttgatgatg atatcaaaga acatttttag 480 
gtgcacctaa cagaatatcc aaataatatg actcacttag atcataatag agcatcaagt 
aaaactaaca ctctaaagca accgatggga aagcatctat aaatagacaa gcacaatgaa 
aatcctcatc atccttcacc acaattcaaa tattatagtt gaagcatagt agtaatttaa 
atcaactagg gatatcacaa gtttgtacaa aaaagcaggc tggtaccggt ccggaattcc 720 
cgggatatcg tcgacccacg cgtccgctga cgcgtgggtt ccactacatc aagacatcta 
ctacactcat cttttttgca cttattgggt gtaaattttt gaaacccagt tgagaaaaat 
gagtgtgtta caatacccag aagggattga cccagcagat gttcagatat ggaacaatgc 
agcatttgat aatggagatt ctgaagattt gtcttcgctg aaacgttctt ggtctcctct 
gaaacccctt tcggttaggc catcagattc ctttgaatct gatttgtcaa gtaaggaaaa 
tcaaactcct ttatttgaga attcatctgt taatctctca tctccgttac ccataaagcc 
acttaaccct aatggggctc tggaaaattc aagactcaag ccgaacaagc ccaattccaa 
acagagtctt gatgagatgg cggctagaaa gagcggaaag ggaaatgatt tccgtgatga 
gaagaaaata gacgaggaaa ttgaagaaat tcagatggag attagtaggt tgagttcaag 
attagaggct ttgagaattg aaaaggctga gaaaactgtt gctaagactg ttgaaaagcg 1320 
aggaagggtt gtggcagcaa agtttatgga gccaaaacaa agtgttatta agattgaaga 1380 
gcgtatatca atgagtgcaa gaacaaaggt ggagcagaga aggggtctta gtttaggacc 1440 
atctgagatt tttactggaa cgcggcggcg agggttgagt atggggccat cagatattct 1500 
agcagggaca acaaaggcac ggcaattggg aaagcaagag atgattatta ctcctattca 
gccaatacaa aacaggcgaa agtcgtgttt ttggaagctt caagagattg aagaagaggg 
aaaaagttca agccttagtc ctaaatcaag aaaaactgct gcaagaacaa tggttacaac 1680 
aaggcaggca gttactacaa ttgcatcaaa gaagaatttg aaaaaagatg atggactttt 
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540 
600 
660 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1560 
1620 



1740 



gagttcagtt 
gaagccccag 
atcagtagtg 
taagaaacgg 
gggtactgaa 
aaacacagag 
aattaggatt 
gatagagttg 
ctgtcaagtt 
agctgctaac 
tcgtgccttt 
tctgaattac 
gttagttttt 
ccaagcttac 
cttctaggga 
gatgattgag 
acctgaatga 
tcattttaac 
atttgttatc 
cacacaacaa 
atcattcatt 
aaaaaacaca 
cataatactc 



cagccaaaga 
aggccgggga 
agaaagaggt 
tcgttatctg 
agtagggtga 
agtgagaaat 
gctcggtgtg 
ataggcaaga 
ttaagttttg 
tcttttcatg 
taattgaatg 
attgtatggt 
agcttaaaat 
gcgtacccag 
taacagggta 
taataatgtg 
acaattgaaa 
cttttaatag 
atccgggcgg 
ataagagaaa 
aactcttctc 
gtaaattaca 
gaac 
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agttgtttaa agatctcgaa aagtctgctg 



gggttgtggc tagtaggtat aatcagagta 
ctttacctga aaatgataag gatgagagta 
tagggaaaac gcgtgtgtct caaactgaga 
aaaagagatg ggaaattcct agtgagattg 
ctccactaag cattattgtg aagcctgatt 
tgaatgagac tcttagggat tctggacctg 
aatcgttttt cagtagtgat gaagataagg 
cagaggaaga tgctgaagag gaataatgt'g 
ctctttcaat tttcaatcct gccttttaat 
gggaagcatt cttttgcttc ctcaaactgg 
acaatatgaa taaggttttg tcttccggca 
agatgcggca gcggccgctc tagagtatcc 
ctttcttgta caaagtggtg atatcacaag 
attatatccc tctagatcac aagcccgggc 
tcacgcatca ccatgggtgg cagtgtcagt 
tgaaaagaaa aaaagtactc catctgttcc 
gtttatacaa taattgatat atgttttctg 
tcttctaggg ataacagggt aattatatcc 
aaacaaataa tattaatttg agaatgaaca 
catccatttc catttcacag ttcgatagcg 
agcacaacaa atggtacaag aaaaacagtt 



ctgctaataa 
caattcagtc 
agagaaatga 
gcaagaattt 
tagttcatgg 
tgcttccgcg 
ctaaaagaat 
agccacctgt 
taataaaggg 
ttttgttcat 
tattctagct 
ggttgtccaa 
ctcgaggggc 
cccgggcggt 
ggtcttctac 
gtgagcaatg 
aaattaaaat 
tatatgtcta 
ctctagacaa 
aaaggaccat 
aaaaccgaat 
ttcccaatgc 



1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3074 



<210> 14 

<211> 668 

<212> DNA 

<213> Oryza sativa 



<220> 
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<221> misc_feature 

<223> prolamin RP6 promter sequence 



cc^ctacat cggcttaggt gtagcaacac gactttatta ttattattat tattattatt 
attattttac aaaaatataa aatagatcag tccctcacca caagtagagc aagttggtga 
gttattgtaa agttctacaa agctaattta aaagttattg cattaactta tttcatatta 
caaacaagag tgtcaatgga acaatgaaaa ccatatgaca tactataatt ttgtttttat 
tattgaaatt atataattca aagagaataa atccacatag ccgtaaagtt ctacatgtgg 
tgcattacca aaatatatat agcttacaaa acatgacaag cttagtttga aaaattgcaa 
tccttatcac attgacacat aaagtgagtg atgagtcata atattatttt tcttgctacc 
catcatgtat atatgatagc cacaaagtta ctttgatgat gatatcaaag aacattttta 
ggtgcaccta acagaatatc caaataatat gactcactta gatcataata gagcatcaag 
taaaactaac actctaaagc aaccgatggg aaagcatcta taaatagaca agcacaatga 
aaatcctcat catccttcac cacaattcaa atattatagt tgaagcatag tagtagaatc 
caacaaca 



<210> 15 

<211> 7 

<212> PRT 

<213> Artificial sequence 



180 

240 

300 

360 

420 

480 

540 

600 

660 

668 



<220> 

<223> Motif 1 CORE SEQUENCE 
<220> 

<221> MI S COFEATURE 

<222> (2) . . (2) 

<223> Xaa can be any amino acid 



<220> 

<221> MISC FEATURE 
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<222> (5) . . (6) 

<223> Xaa can be any amino acid 
<400> 15 

Trp Xaa Asn Ala Xaa Xaa Asp 
^ 5 

<210> 16 

<211> 6 

<212> PRT 

<213> Artificial sequence 

Motif 2 CORE SEQUENCE 

MISCJFEATURE 
(4) . . (5) 

Xaa can be any amino acid 
<400> 16 

Lys Glu Asn Xaa Xaa Pro 
1 5 

<210> 17 

<211> 15 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Motif 3 (coiled coil) CORE SEQUENCE 
<220> 



<220> 
<223> 
<220> 
<221> 
<222> 
<223> 
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<221> MISCJFEATURE 

<222> (2) (2) 

<223> Xaa can be a stretch 



105-Seedyl-EP . ST25 .txt 



of 1 to 6 amino acids 



<220> 

<221> MI S COFEATURE 

<222> (4).. (5) 

<223> Xaa can be any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (8) . - (10) 

<223> Xaa can be any amino acid 



<220> 

<221> MI S COFEATURE 

<222> (12) (13) 

<223> Xaa can be any amino acid 



<400> 17 

Glu Xaa Glu Xaa Xaa Arg Leu Xaa Xaa Xaa Leu Xaa Xaa Leu 
x 5 10 



<210> 18 

<211> 15 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> Motif 4 CORE SEQUENCE 
<220> 
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<221> MISC_FEATURE 
<222> (3) . . (3) 

<223> Xaa can be any amino acid 

MISC_FEATURE 
(5) (5) 

Xaa can be a stretch of 1 to 10 amino acids 



<220> 
<221> 
<222> 
<223> 



<220> 

<221> MIS COFEATURE 

<222> (10) (11) 

<223> Xaa can be any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (14) (14) 

<223> Xaa can be a stretch of 1 to 6 amino acids 



<400> 18 



Leu Pro Xaa He Xaa Arg Asp Ser Gly Xaa Xaa Lys Arg Xaa Lys 



10 15 
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